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REMARKS 



I. Status of the Claims 

Claims 45-61 are pending in this application. 

Claim 54 has been amended to recite an "isolated" nucleic acid to distinguish the claimed 
nucleic acid molecule to any molecule that might exist naturally in a non-isolated form. Support 
for this amendment can be found throughout the specification, including, for example, at page 
14, lines 12-21. The specification has also been amended at pages 23-24 to insert the SEQ ID 
NOs corresponding to the known sequences deposited at GenBank under the Accession Nos 
AE00159, AE000350, and J01713 and referenced at pages 23-24 of the specification. 

This amendment does not add any new matter. 

II. Objection to Specification 

The Office objects to the amendment filed 2 November 2006 under 35 U.S.C. § 132(a) 

alleging that it introduces new matter into the disclosure. Office Action at 2. According to the 

Office, the "additional matter which is not supported by the original disclosure is as follows: 

SEQ ID NOs 14-16." Id. The Office fiarther asserts that if the sequences 

are the GenBank Accession Nos. referenced on pages 23 and 24 of the 
specification, in the context of the specification the Accession Nos. 
are used only to indicate where within the Accession Nos the specific 
primers in the PCR anneal, it is not even disclosed that the Accession 
Nos. were the actual PCR template (e.g. page 23, lines 19-26). Hence, 
the only support found for any sequences from these Accession Nos. 
is the inherent support that the PCR primers used in the specification 
anneal at specific regions in the Accession Nos. This does not provide 
support for the entire Accession Nos. which, at best, appear to be what 
is contained in the Sequence Listing. 

Id 
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Applicant respectfully traverses this objection. As noted by the Office, SEQ ID NOs 14, 
15, and 16 correspond to the sequences deposited at GenBank under the Accession Nos 
AE00159, AE000350, and J01713, respectively. Pages 23-24 of the specification disclose that 
individual tRNA genes were isolated by PGR using primers that anneal at specific base pairs in 
the GenBank sequences AE00159, AE000350, and J01713. Thus, pages 23-24 teach one vv^ay to 
isolate E. coli tRNA genes using known nucleic acid sequences deposited and accessible from 
GenBank. Applicant's reference to these sequences by their Accession numbers provides an 
acceptable, short hand notation for referring to known nucleic acid sequences that complies with 
the written description requirement of 35 U.S.C. § 1 12. 

The Office's position that including the sequences of AE00159, AE000350, and J01713 
in the sequence listing somehow infroduces new matter into the specification because Applicant 
did not also set forth in the specification the known nucleotide sequences of AE00159, 
AE000350, and J01713 exalts form over substance and is directly at odds with Federal Circuit 
precedence on this issue. "Indeed, a requirement that patentees recite known DNA structures, if 
one existed, would serve no goal of the written description requirement. It would neither enforce 
the quid pro quo between the patentee and the public by forcing the disclosure of new 
information, nor would it be necessary to demonstrate to a person of ordinary skill in the art that 
the patentee was in possession of the claimed invention." Falko-Gunter Falkner v. Inglis, 448 
F.3d 1357, 1368 (Fed. Cir. 2006). In Falko-Gunter Falkner, the Federal Circuit held that 
"where, as in this case, accessible literature sources clearly provided, as of the relevant date, 
genes and their nucleotide sequences . . ., satisfaction of the written description requirement does 
not require either the recitation or incorporation by reference (where permitted) of such genes 
and sequences." Id. Accordingly, SEQ ID NOs 14-16, corresponding to nucleotide sequences 
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referenced in the specification and accessible fi-om literature sources (GenBank; see also Blattner 
et al, Science 277(5331): 1453-1474 (1997); Nakajima et al. Cell 23(l):239-249 (1981)) as of 
the priority date, do not constitute new matter and do not require either the recitation or 
incorporation by reference of the specific nucleotide sequences. For these reasons, Applicant 
respectfiiUy requests that the Office withdraw the objection to the amendment filed 2 November 
2006 under 35 U.S.C. § 132(a). 

III. Rejections Under 35 U.S.C. $112, Second Paragraph 

The Office rejects claims 50-52 and 59-61 under 35 U.S.C. §1 12, second paragraph, as 

allegedly "indefinite for failing to particularly point out and distinctly claim the subject matter 

the applicant regards as the invention." Office Action at 3. More specifically, the Office asserts: 

Claims 50-52 and 59-61 recite tRNA sequences comprising sequences 
found between specific base pairs of Genbank Accession Nos. The 
specification provides no guidance on which, if any, of the SEQ ID 
NOS in the Sequence Listing corresponds to the Genbank Accession 
Nos. recited in the claims. 

Id. Applicant respectfiJly traverses this rejection. 

Applicant h£is amended the specification at pages 23 and 24 to indicate that SEQ ID NO 
14 corresponds to AE000159; SEQ ID N0:15 corresponds to AE000350; and J01713 
corresponds to SEQ ID NO: 16. 

The Office also asserts that 

because the content of Genbank entries can (and do) change over time 
due to corrections, it cannot be determined if the sequences 
represented by the SEQ ID NOS (if they are the recited Genbank Nos.) 
are the actual DNA sequences as they existed at the time of filing of 
the instant application (applicants claim priority to 1999). Therefore, it 
cannot be determined what the actual tRNA sequences must be in 
order to anticipate the claims, preventing a satisfactory search of the 
cMms. 



Id 
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Applicant acknowledges that GenBank entries can change over time due, for example, to 
corrections of the sequence. However, GenBank inserts warnings in its entries to alert readers 
when a sequence has changed as well as interactive links to find the altered sequence. For 
example, the GenBank entry for Accession No. AE000159 indicates that the sequence was 
replaced on 21 June 2004 by gi:48994873. See Attachment A. Thus, the 10,133 base pair 
sequence at Accession No. AE 1000 159, represented by SEQ ID NO: 14, corresponds to the 
sequence as submitted and as it existed as of Applicant's priority date. The GenBank entry for 
Accession No. AE000350 indicates that the sequence replaced gi: 178900 on 9 September 1997. 
See Attachment B. There is no indication that the sequence ever changed after 9 September 
1997. Thus, the 10,401 base pair sequence at Accession No. AE 10003 50, represented by SEQ 
ID N0:15, corresponds to the sequence £is it existed £is of Applicant's priority date. Findly, the 
GenBank entry for Accession No. JOl 71 3 indicates that the 1 100 base pair sequence has not 
changed since its submission. See Attachment C. Thus, the 1 100 base pair sequence at 
Accession No. J01713, represented by SEQ ID NO: 16, corresponds to the sequence as submitted 
and as it existed £is of Applicant's priority date. 

As evidenced by Attachments A-C, as of Applicant's priority date, the sequences 
represented by GenBank Accession Nos AE00159, AE000350, and JO 17 13 are readily 
ascertainable and definite. As such, the metes and bounds of the claimed subject matter is clear, 
and Applicant respectfially requests that the Office withdraw this 35 U.S. C. §1 12, second 
paragraph rejection. 
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IV. Rejections Under 35 U.S.C. §112. First Paragraph 

The Office rejects claims 50-52 and 59-61 under 35 U.S.C. §1 12, first paragraph, as 

allegedly failing to comply with the written description requirement. Office Action at 3. 

Specifically, the Office asserts: 

The recitation of tRNA sequences comprising sequences of Genbank 
Accession Nos. in claims 50-52 and 59-61 is an attempt to incorporate 
essential, claimed subject matter by reference to prior art documents 
(i.e. the Genbank entries). A review of the disclosure as filed does not 
reveal these DNA sequences, thus there is no evidence applicants had 
possession of these specific DNA sequences. The incorporation of 
essential material in the specification by reference to an unpublished 
U.S. application, foreign application or patent, or to a publication is 
improper. See MPEP §608.0 l(p) (I). Applicant is required to amend 
the disclosure to include the material incorporated by reference, if the 
material is relied upon to overcome any objection, rejection, or other 
requirement imposed by the Office. 

Id. at 4. Applicant respectfiJly traverses this rejection. 

As noted above, there is no per se rule that written description requirement for nucleic 

acids can only be satisfied by a recitation of the actual nucleotide sequences in the specification. 

In fact, the Federal Circuit has held that requiring an Applicant to recite known DNA structures 

is at odds with the goals of the written description requirement: 

Indeed, a requirement that patentees recite known DNA structures, 
if one existed, would serve no goal of the written description 
requirement. It would neither enforce the quid pro quo between the 
patentee and the public by forcing the disclosure of new information, 

nor would it be necessary to demonstrate to a person of ordinary skill 
in the art that the patentee was in possession of the claimed invention. 
As we stated in Capon, "[t]he 'written description" requirement states 
that the patentee must describe the invention; it does not state that 
every invention must be described in the same way. As each field 
evolves, the balance also evolves between what is knovm and what is 
added by each inventive contribution." M at 1358. Indeed, the forced 
recitation of knovm sequences in patent disclosures would only add 
unnecessary bulk to the specification. 

Falko-Gunter Falkner, 448 F.3d at 1368. 
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The sequences represented by SEQ ID NOs 14-16 were known and accessible from 
GenBank and other references as of Applicant's priority date. Applicant refers to these known 
sequences in the specification using the shorthand Accession numbers: AE00159, AE000350, 
and JO 1 7 1 3 . The Federal Circuit has held that, as in this case, where "accessible literature 
sources clearly provided, £is of the relevant date, genes and their nucleotide sequences . . ., 
satisfaction of the written description requirement does not require either the recitation or 
incorporation by reference (where permitted) of such genes and sequences." Id. ; see also, Enzo 
Biochem, Inc. v. Gen-Probe Inc., 3323 F.3d 956, 966 (Fed. Cir. 2002) ("We therefore agree with 
Enzo that reference in the specification to deposits of nucleotide sequences describe those 
sequences sufficiently to the public for purposes of meeting the written description 
requirement."). Accordingly, Applicant is not required to recite the actual nucleotide sequences 
of these known nucleic acid molecules in the application; nor is Applicant required to 
incorporate them by reference. Applicant's reference in the application to the Accession 
numbers of these known and deposited sequences adequately satisfies the written description 
requirement. See Falko-Gunter Falkner, 448 F.3d at 1368. Thus, Applicant respectfiiUy 
requests the Office to withdraw this written description rejection. 
V. Rejections Under 35 U.S.C. SlOl 

The Office rejects claims 54 and 58 under 35 U.S.C. §101 alleging that the subject matter 
is directed to non-statutory subject matter. Office Action at 10. The Office asserts that the 
"nucleic acid is not recited as purified or isolated, and thus reads on the E. coli genome, a 
product of nature." Id. Applicant has amended claim 54 to recite an "isolated" nucleic acid. 
Claims 55-61 depend directly or indirectly from claim 54. Accordingly, Applicant respectfully 
requests the Office to withdraw this rejection. 
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VI. Rejections Under 35 U.S.C. §103 

A. Del Tito in Combination with Nakamura, Zhang, Saier, Kawakami, Clouthier, 
and Svrind Do Not Render Claims 45-48. 53-56. and 58 Obvious 

The Office rejects claims 45-48, 53-56, and 58 under 35 U.S.C. § 103(a) as allegedly 
obvious over Del Tito et al. ("De/ Tito") in view of Nakamura et al. (^''Nakamura"), Zhang et al. 
C'Zhang"), Saier, Kawakami et al. ("Kawakami"), Clouthier et al. ("Clouthier"), and Sprinzl et 
al. ("Sprinzl"). Office Action at 5-9. Applicant respectfiiUy traverses this rejection. 

To establish a prima facie case of obviousness, the prior art reference (or references when 
combined) must teach or suggest all the claim limitations. See M.P.E.P. § 2142. Applicant 
submits that the combined teachings of the cited references do not teach all of the elements of the 
rejected claims. 

Claims 45 and 54^ recite "sequences encoding E. coli argU, ileY, and leuW tRNAs." The 

Office acknowledges that Del Tito fails to teach the use of a vector comprising an array of the 

three tRNAs: argU, ileY, and leuW. Office Action at 6. However, the Office asserts: 

It would have been obvious to one of ordinary skill in the art at the 
time of the invention to modify the vector construct taught by Del 
Tito et al for compensating for the presence of rarely used codons 

present in the gene encoding a protein of interest by interchanging 
and/or adding different tRNA genes corresponding to other rarely 
used codons in a given cell type, because Del Tito et al teach that it 
is within the skill of the art to carefully scrutinize the coding 
sequence of a protein, identify rarely used codons and compensate 
for the presence of such rarely codons by supplying in trans the 
RNA corresponding to the identified rarely used codons fi-om a 
vector expressing different tRNA genes, and because the rarely 
used codons and corresponding genes were widely known in the 
art (i.e. the teachings of Nakamura et al, Zhang et al, Saier, Sprinzl 
et al, Kawakami et al and Clouthier et al). One would have been 
motivated to do so in order to meet the particular rare-codon 
requirements of a gene encoding a desired protein in combination 



^ The remaining claims lander rejection depend directly or indirectly fi-om independent 
claims 45 and 54. 
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with a given cell type, and thus receive the expected benefit of 
increasing its expression in the given cell type, as taught by Del 
Tito et al. 

Office Action at 7-8; emphasis added. Thus, the Office asserts that it would have been obvious 
to one of skill in the art to add different tRNA genes to the vectors of Del Tito. Applicant 
respectfully disagrees. 

The ileX gene encodes a tRNA that recognizes the AUA codon, a codon that is rarely 
used in E. coli. To enhance the expression of heterologous proteins in E. coli, Del Tito teaches 
coexpressing the heterologous protein and the //eXgene. Del Tito inserted the z/eXDNA into 
three pBR3 22-compatible vectors, one of which (pRI952) was a pACYC184 derivative 
containing a second tRNA gene, argU. Del Tito does not teach a vector or nucleic acid 
containing three tRNA genes. Moreover, Del Tito actually teaches away from adding a third 
tRNA gene to vectors for expressing heterologous proteins. 

A prior art reference must be considered in its entirety, i.e., £is a whole, including portions 
that would lead away from the claimed invention. W.L. Gore & Assoc., Inc. v. Garlock, Inc., 721 
F.2d 1540, 22 USPQ 303 (Fed. Cir. 1983). See M.P.E.P. § 2141.02. Del Tito observed less 
heterologous protein expression when using the pRI952 vector containing two tRNA genes {ileX 
and argU). See De Tito, page 7089, second column (Table 1). In addition, Del Tito notes that 
overexpression of tRNA genes is deleterious to the cell. Id. at 7090, second column, second 
paragraph. This is consistent wdth the specification, which states that "[ojverexpression of tRNA 
genes can be deleterious to the host cell, however. Rojiani et al.. Proc. Nat. Acad. Sci. U.S.A. 
87:151 1-1515 (1990); Sharp et al.. Nucleic Acids Res. 14:7737-7749 (1986)." Specification, 
page 3, lines 12-14. Thus, taken together and considered as a whole, Del Tito in combination 
with other art, like Rojiani and Sharp, teaches away from using three or more tRNAs in a host 
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cell to enhance heterologous gene expression. As indicated in Del Tito, expression of two 
tRNAs reduces heterologous protein expression as compared to vectors containing a single tRNA 
gene. Furthermore, Del Tito and other art references teach that overexpression of tRNA genes is 
deleterious to host cells. Thus, one of skill in the art would not have been motivated to increase 
the number Mid aunount of tRNA gene expression because he would have expected such an 
increase to have a deleterious effect on the host cell. 

Accordingly, the cited references fail to teach or suggest all of the elements of the 
claimed invention. For at least this reason, Applicant submits that the Office has not established 
a prima facie case of obviousness. 

Furthermore, determining whether an invention is obvious from the prior art can require 
more than merely analyzing the differences between the prior art and the claimed invention. The 
Supreme Court has instructed that courts may also evaluate "secondary considerations" of 
obviousness, i.e., other objective factors that may show that £in invention was not obvious from 
the prior art, even if the differences between the prior art and the claimed invention seem slight. 
Graham v. John Deere Co. of Kansas City, 383 U.S. 1, 17-18, 86 S. Ct. 684, 694, 15 L. Ed. 2d 
545, 148 USPQ 459, 467 (1966) ("Such secondary considerations as commercial success, long 
felt but unsolved needs, failure of others, etc., might be utilized to give light to the circumstances 
surrounding the origin of the subject matter sought to be patented. As indicia of obviousness or 
nonobviousness, these inquiries may have relevancy." Indeed, "[i]t is the secondary 
considerations that are often the most probative and determinative of the ultimate conclusion of 
obviousness or nonobviousness." Pro-Mold and Tool Co. v. Great Lakes Plastics, Inc., 75 F.Bd 
1568, 1573, 37 USPQ2d 1626, 1630 (Fed. Cir. 1996) (reversing summary judgment of 
obviousness because district court failed to consider patentee's commercial success). 
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Here, the Office must consider the evidence of commercial success submitted in parent 
Application No. 09/492,590 {see Attachments D and E), evidence providing an objective 
indication of non-obviousness. Specifically, the Rule 132 Declarations report the significant 
sales figures of competent cells (BL21-CodonPlus™ RIL (Catalog # 230240); BL21- 
CodonPlus™ (DE3)-RIL (Catalog # 230245); and BL21-CodonPlusTM (DE3)-RIL-X (Catalog # 
230265); the "RIL products") comprising a vector or nucleic acid having sequences encoding E. 
coli argU, ileY, and leuW tRNAs, as recited in the pending claims. As explained in the Rule 132 
Declaration, the commercial success of the RIL products is due to the presence of three tRNA 
genes (i.e., argU, ileY, and leuW) encoding rarely used codons {see Attachment D at page 3, T|2), 
thereby establishing a nexus between the commercial success and the claimed invention. J. T. 
Eaton & Co. v. Atlantic Paste & Glue Co., 106 F.3d 1563, 1571, 41 USPQ2d 1641, 1647 (Fed. 
Cir. 1997) ("When a patentee can demonstrate commercial success, usually shown by significant 
sales in a relevant market, and that the successful product is the invention disclosed and claimed 
in the patent, it is presumed that the commercial success is due to the patented invention.") 

Accordingly, for the re£isons set forth above. Applicant requests that the Office reconsider 

and withdraw the 35 U.S.C. § 103 rejection of claims 45-48. 53-56. and 58 based on Del Tito in 

combination with Nakamura, Zhang, Saier, Kawakami, Clouthier, and Sprinzl. 

B. Del Tito in Combination with Nakamura, Zhang, 

Saier, Kawakami, Clouthier, Sprinzl, and Further in 
View of Skerra, Do Not Render Claims 49 and 57 Obvious 

The Office rejects claims 49 and 57 under 35 U.S.C. § 103(a) as allegedly obvious over 

Del Tito, Nakamura, Zhang, Saier, Kawakami, Clouthier, and Sprinzl as applied to claims 45-48, 

53-56, and 58 and further in view of U.S. Patent No. 5,849,576 to Skerra et al. CSkerra"). 

Office Action at 9-10. Applicant respectfully traverses this rejection. 
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Applicant submits that the combined teachings of the cited references do not teach all of 
the elements of the rejected claim. As noted above, neither Del Tito, Nakamura, Zhang, Saier, 
Kawakami, Clouthier, nor Sprinzl teaches or suggests a nucleic acid or vector comprising 
sequences encoding E. coli argU, ileY, and leuW tRNAs. Skerra fails to remedy the deficiencies 
of Del Tito, Nakamura, Zhang, Saier, Kawakami, Clouthier, and Sprinzl. Accordingly, the cited 
references fail to teach or suggest all of the elements of the claimed invention. For at least this 
reason, Applicant requests that the Office reconsider and withdraw the 35 U.S. C. § 103 rejection 
of claims 49 and 57 based on Del Tito, Nakamura, Zhang, Saier, Kawakami, Clouthier, and 
Sprinzl in combination with Skerra. 
VII. Conclusion 

In view of the foregoing Amendment and Remarks, Applicant submits that this 
application is in condition for allowance. Applicant therefore requests entry of this Amendment, 
the Examiner's reconsideration and reexamination of the application, £ind the timely allowance 
of the pending claims. 



16 



Attorney Docket No.: STG-201-01 
U.S. Application No.: 10/777,010 
Customer No.: 27,495 

Please grant any extensions of time required to enter this paper and charge any additiond 

required fees to Deposit Account No. 50-3740. 

Respectftilly submitted, 
Carsten-Peter CARSTENS 



Date: 27 December 2007 By: / Timothy B. Donaldson/ 

Timothy B. Donaldson 
Reg. No. 43,592 



LATIMER, MAYBERRY & MATTHEWS IP LAW, LLP 
13873 Park Center Road 
Suite 122 

Hemdon,VA 20171 



Tel. 703-463-3073 
Fax. 703-463-3071 



Attachments: 



GenBank Accession No. AE000159 printout from NCBI website (Attachment A) 

GenBank Accession No. AE000350 printout from NCBI website (Attachment B) 

GenBank Accession No. J01713 printout from NCBI website (Attachment C) 

Copy of Rule 132 Declaration of Mary Buchanan from parent Application No. 09/492,590 
dated 8 February 2001 (Attachment D) 

Copy of Rule 132 Declaration of Mary BuchEuiem from parent Application No. 09/492,590 
dated 8 November 2001 (Attachment E) 
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Go i Clear j 
History Clipboard Details 

Hide: □ sequence i J all but gene, CDS and mRNA features 

□ Reverse complemented strand Features: _ Refresh } 



□ 1: AEOOO 1 59 . Reports ...[gi: 1786739] The record has been replaced by U00096 
Comment Features Sequence 



LOCUS AE000159 10133 bp 

DEFINITION Escherichia coli K12 MG1655 
genome . 

ACCESSION AE000159 U00096 
VERSION AE000159.1 GI: 1786739 



: Enterobacteriales; 



TITLE 
JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



Bacteria; "proteobacteria; Gamraaproteobacteria; 
Enterobacteriaceae; Escherichia. 

1 (bases 1 to 10133) 

Blattner,F.R., Plunkett,G. Ill, Bloch,C.A., Perna,N.T., Burland,V., 
Riley, M., Collado-Vides, J. , Glasner, J. D. , Rode,C.K., Mayhew,G.F., 
Gregor,J., Davis, N.W., Kirkpatriok, H. A. , Goeden.M.A., Rose, D.J. , 
Mau,B. and Shao,Y. 

The complete genome sequence of Escherichia coli K-12 
Science 277 (5331), 1453-1474 (1997) 

9278503 

2 (bases 1 to 10133) 
Blattner, F.R. 
Direct Submission 

Submitted {16-JAN-1997) Guy Plun)cett III, Laboratory of Genetics, 
University of Wisconsin, 445 Henry Mall, Madison, WI 53706, USA. 
Email: ecoliegenetics.wisc.edu Phone: 608-262-2534 Fax: 
608-263-7459 

3 (bases 1 to 10133) 
Blattner, F.R. 
Direct Submission 

Submitted ( 02-SEP-1997 ) Guy Plunkett III, Laboratory of Genetics, 
University of Wisconsin, 445 Henry Mall, Madison, WI 53706, USA. 
Email: ecoli@genetics.wisc.edu Phone: 608-262-2534 Fax: 

608-263-7459 

4 (bases 1 to 10133) 
Plunkett, G. III. 
Direct Submission 
Submitted (13-OCT-199J 
Wisconsin, 445 Henry t 

[WARNING] On Jun 21, 2004 this sequence was replaced by 



This 
Uni\ 



sequence wa; 
;ity of Wi£ 



Supported by NIH grant; 
Project and NCHGR) . The 
lined from E. col; 
les were determined 
: Borodovsky, 



lined by the E. coli Genome Project at the 
■Madison (Frederick R. Blattner, director) . 
HG00301 and HG01428 (from the Human Genome 
entire sequence was independently 
K12 strain MG1655. Predicted open reading 
ising GeneMark software, kindly supplied by 
Institute of Technology, Atlanta, GA, 



30332 [e-mail: mark@amber.gatech.edu]. Open reading frames that 
have been correlated with genetic loci are being annotated with CG 
Site Nos., unique ID nos . for the genes in the E. coli Genetic 
Stock Center (CGSC) database at Yale University, kindly supplied by 
Mary Berlyn. A public version of the database is accessible 
(http:/ycgsc.biologY..Jji[aij._-_,§.^^ Annotation of the genome is an 
ongoing task, whose goal Is to make the genome sequence more useful 
by correlating it with other data. Comments to the authors are 
appreciated. Updated information will be available at the E. coli 
Genome Project's World Wide Web site 

( http : //www . genetics . wise . edu ) . *** The E. coli K12 sequence and 
its annotations are periodically updated; this is version M54. No 
sequence changes. Annotation updates: updated gene identifications 
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http://www.ncbi.nlm.nih.gov/entrez/viewer.fcgi?db=nuccore&id-17.. 



and products; all new functional assignments courtesy of Monica 
Riley; added promoters, protein binding sites, and repeated 
sequences described in reference 1. The unique numeric identifiers 
beginning with a lowercase 'b' assigned to each gene (protein- or 
RNA-encoding) are now designated as gene synonyms instead of 
labels. This should allow them to be searched for in Entrez as gene 

Location/Qualifiers 
5 1.. 10133 

/organism="Escherichia coli K12" 

/mol_type=" genomic DNA" 

/strain="K12" 

/sub_strain="MG1655" 

/db_xr e f = " taxon : 8 3333" 
ter complement (<1 . . 20") 

/note="factor SigmaVO; predicted +1 start at 555785" 
tar complement (9. .37) 

/note="factor Sigma70; predicted +1 start at 555802" 

complement (84 . .317) 

/gene="ybcJ" 

/note=" synonym: b0528" 

complement ( 8 4 . .317) 

/gene="ybcJ" 

/function="orf ; Unknown" 

/note="f77; 98 pet identical to YBCJ_ECOLI SW: P45571 but 

has 7 additional N-ter residues" 

/codon_start=l 

/transl_table=ll 

/product="orf , hypothetical protein" 
/protein id=" AAC73630 . 1 " 
/db_xref="GI : 1786740" 

/translation="MIHRMSNMATFSLGKHPHVELCDLLKLEGWSESGAQAKIAIAEG 

QVKVDGAVETRKRCKIVAGQTVSFAGHSVQWA" 

complement (298 . .1164) 

/gene="folD" 

/note=" synonym: b0529" 

complement (298. .1164) 

/gene="f olD" 

/EC number'=" 1.5.1.5 " 

/function="enzyme; Biosynthesis of cofactors, carriers: 



gr^omoter 
promoter 



promoter 



99 pet identical 



> FOLD_ECOLI SW: P24186" 



Folic 

/codon_; 
/transl_table=ll 

/product="5, 10-methylene-tetrahydrofolate dehydrogenase; 
5, 10-methylene-tetrahydrofolate cyclohydrolase" 

/protein id=" AfiC73631 . 1 " 
/db_xref="GI: 1786741" 

/translation="MAAKIIDGKTIAQQVRSEVAQKVQARIAAGLRAPGLAWLVGSN 
PASQIYVASKRKACEEVGFVSRSYDLPETTSEAELLELIDTLNADNTIDGILVQLPLP 
AGIDNVKVLERIHPDKDVDGFHPYNVGRLCQRAPRLRPCTPRGIVTLLERYNIDTFGL 
NAWIGASNIVGRPMSMELLLAGCTTTVTHRFTKNLRHHVENADLLIVAVGKPGFIPG 
DWIKEGAIVIDVGINRLENGKWGDWFEDAAKRASYITPVPGGVGPMTVATLIENTL 
QACVEYHDPQDE" 



complei 



.351) 

^"folD" 
/note="f actor Sigma70; 
complement (397 . .428) 
/gene="folD" 
/note=" factor Sigma7 0; 
complement (4 01 . . 429) 
/gene="f olD" 
/note="f actor Sigma32; 
complement (454. .482) 
/gene="folD" 
/note=" factor Sigma7 0; 
1553. .1581 

/note=" factor Sigma7 0; 
1602. .2177 
/gene="sfmA" 
/note="synonym: b0530" 
1602. .2177 
/gene="sfmA" 
/function="putative 
/note="ol91; This 191 
to 180 residues of an 
SW: P37921" 
/codon_start=l 



predicted +1 start ; 



predicted +1 start at 556194" 



Not classified" 
aa ORF is 66 pet identical (5 gaps) 
approx. 192 aa protein FM1A_SALTY 
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/transl_table=il 

/product="putative f imbrial-like protein" 
/protein_id= " AAC7 3 6 3 2_. 1 " 
/db xref="GI: lT867 4 2'' 

/translation="MESINEIEGIYMKLRFISSALAAALFAATGSYAAWDGGTIHFE 

GEL\™AACSVNTDSADQWTLGQyRTDIFNAVGNTSALIPFTIQLNDCDPWAANAAV 

AFSGQADAINDNLLAIASSTNTTTATGVGIEILDNTSAILKPDGNSFSTNQNLIPGTN 

VLHFSARYKGTGTSASAGQANADATFIMRYE" 

232S. .2354 

/note="factor Sigina70; predicted +1 start at 558161" 

2397 . .3089 

/gene="sfmC" 

/note=" synonym: b0531" 

2397 . .3089 

/gene="sfmC" 

/function="putative factor; Not classified" 

/note="o230 was o215; This 230 aa ORF is 62 pet identical 

{0 gaps) to an approx. 232 aa protein FIMC_SALTY SW: 

P37923" 

/codon_start=l 
/transl_table=ll 
/product="putative chaperone" 
/protein_id="AA.C73633 . 1" 
/db xref="GI: 1786743" 

/translation="MMTKIKLLMLIIFYLIISASAHAAGGIALGATRIIYPADAKQTA 

VWIRNSHTNERFLVNSWIENSSGVKEKSFIITPPLFVSEPKSENTLRIIYTGPPLAAD 

RESLFWMNVKTIPSVDKNALNGRNVLQLAILSRMKLFLRPIQLQELPAEAPDTLKFSR 

SGNYINVHNPSPFYVTLVNLQVGSQKLGNAMAAPRVNSQIPLPSGVQGKLKFQTVNDY 

GSVTPVREVNLN" 

3120. .5723 

/gene="sfmD" 

/note="synonym: b0532" 

3120. .5723 

/gene="sfmD" 

/function="putative membrane; Not classified" 
/note="o867; residues 25-568 are 69 pet identical (2 gaps) 
to aa 26-569 from FIMD_SALTY SW: P37924 (870 aa) " 
/codon_start=l 
/transl_table=U 

/product="putative outer membrane protein, export 

function" 

/protein id=" AAC7 3634 _J." 
/ db_x ref="GI:178 674 4 " 

/translation="MKIPTTTDIPQRYTWCLAGICYSSLAILPSFLSYAESYFNPAFL 

LENGTSVADLSRFERGNHQPAGVYRVDLWRNDEFIGSQDIVFESTTENTGDKSGGLMP 

CFNQVLLERIGLNSSAFPELAQQQNNKCINLLKAVPDATINFDFAAMRLNITIPQIAL 

LSSAHGYIPPEEWDEGIPALLLNYNFTGNRGNGNDSYFFSELSGINIGPWRLRNNGSW 

NYFRGNGYHSEQWNNIGTWVQRAIIPLKSELVMGDGNTGSDIFDGVGFRGVRLYSSDN 

MYPDSQQGFAPTVRGIARTAAQLTIRQNGFIIYQSYVSPGAFEITDLHPTSSNGDLDV 

TIDERDGNQQNYTIPYSTVPILQREGRFKFDLTAGDFRSGNSQQSSPFFFQGTALGGL 

PQEFTAYGGTQLSANYTAFLLGLGRNLGNWGAVSLDVTHARSQLADASRHEGDSIRFL 

YAKSMNTFGTNFQLMGYRYSTQGFYTLDDVAYRRMEGYEYDYDGEHRDEPIIVNYHNL 

RFSRKDRLQLNVSQSLNDFGSLYISGTHQKYWNTSDSDTWYQVGYTSSWVGISYSLSF 

SWNESVGIPDNERIVGLNVSVPFNVLTKRRYTRENALDRAYASFNANRNSNGQNSWLA 

GVGGTLLEGHNLSYHVSQGDTSNNGYTGSATANWQAAYGTLGGGYNYDRDQHDVNWQL 

SGGWGHENGITLSQPLGDTNVLIKAPGAGGVRIENQTGILTDWRGYAVMLYATVYRY 

NRIALDTNTMGNSIDVEKNISSWPTQGALVRANFDTRIGVRALITVTQGGKPVPFGS 

LVRENSTGITSMVGDDGQVYLSGAPLSGELLVQWGDGANSRCIAHYVLPKQSLQQAVT 

VISAVCTHPGS" 

5765. .6742 

/gene="sfmH" 

/note="synonym: b0633" 

5765. . 6742 

/gene="sfmH" 

/function="phenotype; Not classified" 

/note="o325; 69 pet identical to FIMH_SALTY SW: P37925 

(335aa) " 

/codon_start=l 

/transl_table=_ll 

/product="involved in fimbrial asembly" 
/protein_id="AAC73635 . 1" 
/db_xref="GI: 1786745" 

/translation="MACLCLANISWATVCANSTGVAEDEHYDLSNIFNSTNNQPGQIV 
VLPEKSGWVGVSAICPPGTLVNYTYRSYVTNFIVQETIDNYKYMQLHDYLLGAMSLVD 
SVMDIQFPPQNYIRMGTDPNVSQNLPFGVMDSRLIFRLKVIRPFINMVEIPRQVMFTV 
YVTSTPYDPLVTPVYTISFGGRVEVPQNCELNAGQIVEFDFGDIGASLFSAAGPGNRP 
AGVMPQTKSIAVKCTNVAAQAYLTMRLEASAVSGQAMVSDNQDLGFIVADQNDTPITP 
NDLNSVIPFRLDAAAAANVTLRAWPISITGQKPTEGPFSALGYLRVDYQ" 
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b0534" 



promot.e.i: 
promoter 



promoter 
promo ter 



: 563758" 



6753. .7268 
/gene="sfmF' 
/note=" synonym: 
6753. .7268 
/gene="sfmF" 

/function="putative structure; Not classified" 

/note="ol71; formerly designated ybcG; 57 pet identical (1 

gap) to FIMF_SALTY SW : P37926 (172 aa) " 

/codon_start=l 

/transl_table=ll 

/product="putative f imbrial-like protein" 
/protein_id="iiAC73636 . 1" 
/db xref="GI:l786746" 

/ trans lation="MRRVLFSCFCGLLWSSSGWAVDPLGTININLHGNWDFSCTVNT 

ADIDKTVDLGRWPTTQLLNAGDTTALVPFSLRLEGCPPGSVAILFTGTPASDTNLLAL 

DDPAMAQTVAIELRNSDRSRLALGEASPTEEVDANGNVTLNFFANYRALASGVRPGVA 

KADAIFMINYN" 

complement (7271. .7 903) 

/gene="f imZ" 

/note=" synonym: b0535" 

complement (7271. .7903) 

/gene="fimZ" 

/function="putative regulator; Surface structures" 
/note="f210; 100 pet identical to FIMZ_ECOLI SW: P21502" 
/codon_start=l 
/transl table=ll 

/product="fimbrial Z protein; probable signal transducer" 
/protein id=" AAC73637 . 1 " 
/db xref="GI: 1786747" 

/translation="MKPTSVIIMDTHPIIRMSIEVLLQKNSELQIVLKTDDYRITIDY 
LRTRPVDLIIMDIDLPGTDGFTFLKRIKQIQSTVKVLFLSSKSECFYAGRAIQAGANG 
FVSKCNDQNDIFHAVQMILSGYTFFPSETLNYIKSNKCSTNSSTVTVLSNREVTILRY 
LVSGLSNKE lADKLLLSNKTVSAHKSNI YGKLGLHS IVELI DYAKL YELI " 
complement (7 965. .7991) 
/note="factor Sigma70; predicted ^ 
complement (8040. .8069) 
/note="factor Sigma70; predicted +1 start at 563833" 
8146. .8222 
/gene="argU" 
/note="synonym: b0536" 
8146. .8222 
/gene="argU" 
/product="tRNA-Arg" 

/note="anticodon: UCU; CG Site No. 17755; alternate gene 

names dnaY, pin" 

/function="RNA; tRNA" 

complement (8238. .9401) 

/gene="intD" 

/note="synonym: b0537" 

complement (8238. .9401) 

/function="IS, phage, 
prophages" 

/note="f387; 100 pet identical 
/codon_start=l 

/transl_table=n 

/product="prophage DLP12 integrase" 
/protein id=" ?\AC73638 . 1 " 
/db_xref="GI:1786748" 

/translation="MSLFRREEIWYASYSLPGGKRIKESLGTKDKRQAQELHDKRKAE 

LWRVEKLGDLPDVTFEEACLRWLEEKADKKSLDSDKSRIEFWLEHFEGIRLKDISEAK 

lYSAVSRMHNRKTKEIWKQKVQAAIRKGKELPVYEPKPVSTQTKAKHLAMIKAILRAA 

ERDWKWLEKAPVIKIPAVRNKRVRWLEKEEAKRLIDECPEPLKSWKFALATGLRKSN 

IINLEWQQIDMQRRVAWVNPEESKSNRAIGVALNDTACKVLRDQIGKHHKWVFVHTKA 

AKRADGTSTPAVRKMRIDSKTSWLSACRRAGIEDFRFHDLRHTWASWLIQSGVPLSVL 

QEMGGWESIEMVRRYAHLAPNHLTEHARKIDDIFGDNVPNMSHSEIMEDIKKA" 

9228 . . 9257 

/note="factor Sigma70; predicted +1 start at 565064" 
9308. .9336 

/note="factor Sigma70; predicted +1 start at 565143" 
9341. .9373 

/note="factor Sigma70; predicted +1 start at 565180" 

9395. .9955 

/gene="b0538" 

9395. .9955 

/gene="b0538" 

/function="putative regulator; Not classified" 
/note="ol86; This 186 aa ORF is 29 pet identical (4 gaps) 



Phage-related functions and 

) INTD_ECOLI SW: P24218" 
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to 57 residues of an approx. 328 aa protein CRJA_TRICy SW: 
Q03442" 

/codon_start=l 
/transl_table=ll 

/product="putative sensory transduction regulator" 

/protein id=" AAC73639 . 1 " 
/db xref="GI:17 8 674 9" 

/translation="MTFSFLCYAI TRAHLDSMQRRLKRRNAGLPCCEVPGYY RE DASS 

PLLPKYKTNFSQCLVHFFDELRHHLA/KTRHVLFIPLNHDIMQAFTLHARVIVGKVPGI 

FSRHPHAVLHLGHVSRFYCLETTEPEFHEIPGGTSIAGSVACIRMQSLIRGNRTRITG 

WLPTCTRTCGPMRWLTRNYGARLPMN" 

complement (9443. .9471) 

/note="factor Sigma70; predicted +1 start at 565236" 

complement (9521. .9784) 

/gene="b0539" 

complement (9521. .9784) 

/gene="b0539" 

/function="putative enzyme; Not classified" 
/note="f87; This 87 aa ORF is 96 pet identical to 
87residues of a 226 aa protein EXO_LAMBD SW: P03697 but 
contains -139 additional N-ter aa" 
/codon_; 



isl_table= 



/product="putative exonuclease (EC 3.1.11.3) similar to 
lambda" 

/protein id=" AAC73640 . 1 " 
/db xref="GI: 1786750" 

/translation="MKFRLGGFEAIKSAYMAQVQYSMWVTRKDAWYFANYDPRMKREG 
LHYWIERNEKYMASFDEMVPEFIEKMDEALAEIGFVFGEQWR" 
promot er complement (9938 .. 9965) 

' /note="factor Sigma70; predicted +1 start at 565731" 

N 

1 atgctatcag catggatgaa cggggcgtag agggcaaaag tctgaaaaga gaaccggcct 
61 gttgatacag gccgggaaag ggatcaggca acaacctgta cgctgtgacc tgcaaaactc 
121 actgtctgac cggcgacgat tttgcagcgt ttgcgcgttt caaccgcacc gtcgactttc 
181 acctggcctt cggcaatcgc gattttcgcc tgcgcgccgc tttcgctcca gccttccagt 
241 ttcagcaagt cgcacagctc aacgtgcgga tgtttaccta aagaaaatgt cgccatgtta 
301 ctcatcctgt ggatcatgat attcaacgca cgcctgtagc gtgttttcaa tcagcgtggc 
361 aaccgtcatc gggccaacgc cgccgggaac aggcgtaatg tatgaggcgc gtttagccgc 
421 gtcttcaaac acgacgtcgc ccacaacttt gccattttcc agacggttga tgccgacatc 
481 aatcacaatt gcgccttctt tgatccagtc accgggaata aagcctggct tgccaacggc 
541 aacgatcaat agatcggcat tttctacgtg atgacgcaga tttttagtga agcggtgagt 
601 cactgtagtg gtgcaacctg ccagcagcag ttccatgctc atcgggcggc caacgatatt 
661 cgatgcgcca atcaccacgg cgttgaggcc gaaggtatca atgttgtaac gctcaagcag 
721 cgtgacgata ccgcgcgggg tgcagggacg cagacgcggc gcgcgctggc acagacgacc 
781 gacgttgtaa ggatggaaac cgtccacgtc tttgtccgga tgaatacgtt ccagcacttt 
841 gacgttatca atacccgccg gtaacggcag ttgaaccaga atgccatcga tggtgttgtc 
901 ggcattcagc gtatcgataa gctccagcag ctccgcttcg ctggtggttt ccgggaggtc 
961 ataagagcgg gagacgaacc cgacttcttc acaagccttg cgtttgcttg cgacataaat 
1021 ttgcgatgca gggttactac ccaccagcac aacggccagt cctggtgccc gcagtccggc 
1081 tgcaatacgc gcctgaactt tttgagcaac ttcagagogc acctgctgcg caatcgtttt 
1141 accgtcaata atctttgctg ccatcagaga gaggattcca 
1201 ggatgcgcct attttgtcag aagcggggcg cgctgtcagg 
1261 gtgaagcgac ctcttgcgaa ggtgaggcgc accgtcgctg 
atctggaact gatttatttc 
tataaaaaga attttcagcc 
attttgcgcg gaagtattca 
ttcaattaag 
agtaatcggt 
taagattta 



icgggtgt cggtattgaa 

2041 atagcttctc aaccaaccag 

2101 gttataaagg caccggtaca 

ttatgagata tgaataatca 



1981 



2161 1 



tctgttacgt agatcgaagg 

tttcgtttca gatttatcgc 

agactgaaag cttcattttt 

cttgtctaag gattaagata 

tggtaattta ccgcttcagg 

tctaacgggg ctctctattt 

aggataaaaa ggaatttact 

atatacgga tgtggagtcg ataaatgaga 

ttcgtctgc gctggctgcc gcactattcg 

tggcggtac aattcacttt gaaggcgaac 

ctcggcaga ccaggttgtc acactcggtc 

cgatattttc aatgctgttg gtaatacctc tgcattaatt ccattcacca 

cgactgcgat cctgttgttg ccgctaatgc tgccgttgca ttttctggtc 

tcaatgat aatttattgg ccattgcatc cagtaccaat r 



1321 cgtccatgat ggcgttgt. 
1381 atttaagaaa tacctgacaa 
1441 tctatatttg tgttgaatat 
1501 tttagaatag agtgcatatt 
1561 actatcagtg tcttaaataa 
1621 ttgaaggaat atatatgaaa 
1681 ccgctacggg tagttatgct gccgttgtag 
1741 tggtgaatgc tgcctgttca gtgaat. 
1801 aatatcgtai 
1861 ttcagttga. 
1921 aggctgatgi 



itacttgata 

lacttgatcc ccgggaccaa 
igtgcatcag cagggcaagc 
laaccacgtt gttttgaatt 



;atat< 



;tgatggga 
[ tcatgttcac 



2221 aagtaatgta ccggttgtct gaagcggtat ggtggc; 

2281 tttgtatcat gccgctttat taaatgaaaa gggaatgatg tgttgtaag; 

2341 atcatttctt tatattcctt atttttgccg tcaggaatac acaaggcgta ttaactatga 

2401 tgactaaaat aaagttattg atgctcatta tattttattt aatcatttcg gccagcgccc 

2461 atgctgccgg agggatcgca ttaggtgcca cgcgtattat ttatcccgct gatgctaaac 

2521 agactgcggt atggattaga aatagccata ccaatgagcg ctttctggtc aattcgtgga 

2581 ttgaaaacag cagcggtgta aaagaaaagt cattcatcat tacaccgcca ctgtttgtta 

2641 gtgaacccaa aagcgaaaat actttgcgta ttatttacac cggtccaccg ctggcagcag 

2701 atcgtgagtc tctgttctgg atgaatgtta agacgatccc ttcggtagat aaaaatgcat 
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2761 tgaacggcag gaatgttttg caactggcga ttttatcgcg catgaaatta tttctccgtc 
2821 caattcaatt acaagaatta cccgcagaag cgccggacac actcaagttt tcgcgatccg 
2881 gtaactatat caatgttcat aatccatcac ctttttatgt caccctggtt aacttacaag 
2941 tgggcagcca aaagttgggg aatgctatgg ctgcacccag agttaattca caaattccct 
3001 taccctcagg agtgcaggga aagctgaaat ttcagaccgt taatgattat ggttcagtaa 
3061 ctccggtcag agaagtgaac ttaaactaac cgaatcatct gacaatatca gagctaatta 
3121 tgaaaatacc cactactacg gatattccgc agaggtatac ctggtgtctg gccggaattt 
3181 gttattcatc tcttgccatt ttaccctcct ttttaagcta tgcggaaagt tatttcaacc 
3241 cggcattttt attagagaat ggcacatccg ttgctgattt atcgcgcttt gagagaggta 
3301 atcatcaacc tgcgggcgtg tatcgggtgg atctctggcg taatgatgag ttcattggtt 
3361 cgcaggatat cgtatttgaa tcgacaacag aaaatacagg tgataaatca ggtgggttaa 
3421 tgccctgttt taaccaggta cttcttgaac gaattggcct taatagcagt gcatttcccg 
3481 agttagccca gcagcaaaac aataaatgca tcaatttact gaaagctgta cctgatgcca 
3541 caattaactt tgattttgca gcgatgcgcc tgaacatcac tattcctcag atagcgttgt 
3601 tgagtagcgc tcacggttac attccgcctg aagagtggga tgaaggtatt cctgctttac 
3661 tcctgaatta taatttcacc ggtaacagag gtaatggtaa cgatagctat ttttttagtg 
3721 agctcagcgg gattaatatt ggcccgtggc gtttacgcaa caatggttcc tggaactatt 
3781 ttcgcggaaa tggatatcat tcagaacagt ggaataatat tggcacctgg gtacagcgcg 
3841 ccattattcc gctgaaaagt gaactggtaa tgggagacgg caatacagga agtgatattt 
3901 tcgatggcgt tggatttcgt ggtgtacggc tttattcttc tgataatatg tatcctgata 
3961 gccagcaagg gtttgcccca acggtacgtg ggattgcccg tacggcggcc cagctaacga 
4021 ttcggcaaaa tggttttatt atctatcaaa gctatgtttc ccccggcgct tttgaaatta 
4081 cagatttgca cccgacatct tcaaatggcg atctggacgt caccatcgac gagcgcgatg 
4141 gcaatcagca gaattacaca attccgtatt caacagtgcc aattttacaa cgcgaagggc 
4201 gtttcaaatt tgacctgacg gcgggcgatt ttcgtagcgg taatagtcag caatcatcgc 
4261 ctttcttttt tcagggtacg gcactcggcg gtttaccaca ggaatttact gcctacggcg 
4321 ggacgcaatt atctgccaat tacaccgcct ttttattagg gctggggcgc aatctcggga 
4381 actggggcgc agtgtcgctg gatgtaacgc atgcgcgcag tcagttagcc gacgccagtc 
4441 gtcatgaggg ggattctatt cgcttcctct atgcgaaatc gatgaacacc ttcggcacca 
4501 attttcagtt aatgggttac cgctattcga cacaaggttt ttataccctt gatgatgttg 
4561 cgtatcgtcg aatggagggg tacgaatatg attacgacgg tgagcatcgc gatgaaccga 
4621 taatcgtgaa ttaccacaat ttacgcttta gccgtaaaga ccgtttgcag ttaaatgttt 
4681 cacaatcact taatgacttt ggctcgcttt atatttctgg tacccatcaa aaatactgga 
4741 atacttcgga ttcagatacg tggtatcagg tggggtatac cagcagctgg gttggcatca 
4801 gttattcgct ctcattttcg tggaatgaat ctgtagggat ccccgataac gaacgtattg 
4861 tcggacttaa tgtttcagtg cctttcaatg ttttgaccaa acgtcgctac acccgggaaa 
4921 atgcgctcga ccgcgcttat gcctccttta acgccaaccg taacagcaac gggcaaaata 
4 981 gctggctggc aggtgtaggt gggaccttac tggaaggcca caacctgagt tatcacgtaa 
5041 gccagggtga tacctcgaat aatgggtaca cgggcagcgc cacggcaaac tggcaggccg 
5101 cttacggtac gctggggggc gggtataact acgaccgcga tcaacatgac gttaactggc 
5161 agctgtctgg cggtgtggtc gggcatgaaa atggcataac gctgagccag cctttagggg 
5221 ataccaatgt tttgattaaa gcgcctggcg caggcggtgt acgcattgaa aatcaaactg 
5281 gcattttaac cgactggcgc ggctatgcgg tgatgctgta tgccacggtt tatcggtata 
5341 accgtatcgc gcttgatacc aatacgatgg ggaattccat cgatgttgaa aaaaatatta 
5401 gcagcgttgt gccgacgcaa ggcgcgttgg ttcgtgccaa ttttgatacc cgcataggcg 
5461 tgcgggcgct cattaccgtt acccagggcg gaaaaccggt gccgtttgga tcactggtac 
5521 gggaaaacag taccggaata accagtatgg tgggtgatga cgggcaagtt tatttaagtg 
5581 gtgcgccatt gtctggtgaa ttactggttc agtggggaga cggcgcgaac tcacgctgca 
5641 ttgcgcacta tgtattgccg aagcaaagct tacagcaagc cgtcactgtt atttcggcag 
5701 tttgcacaca tcctggctca taaaggaaat tatcaataag ataatctgca gattattatt 
5761 ggcgatggca tgtttgtgtc tggcaaacat atcctgggct actgtttgtg caaatagtac 
5821 tggcgtagca gaagatgaac actatgatct ctcaaatatc tttaatagca ccaataacca 
5881 gccagggcag attgttgttt taccggaaaa atccggctgg gtaggtgtct cagcaatttg 
5941 tccacccggt acgctggtga attatacata ccgtagttat gtcaccaact ttattgttca 
6001 ggaaactatc gataattata aatatatgca attacatgat tatctattag gtgcgatgag 
6061 tctggttgat agtgtgatgg atattcagtt ccccccgcaa aattatattc ggatgggaac 
6121 agatcctaac gtttcgcaaa accttccatt cggggtgatg gattctcgtt taatatttcg 
6181 tttaaaggtt attcgtccct ttattaacat ggtggagatc cccagacagg tgatgtttac 
6241 cgtgtatgtg acatcaacgc cttacgatcc gttggttaca cctgtttata ccattagttt 
6301 tggtggccgg gttgaagtac cgcaaaactg cgaattaaat gccgggcaga ttgttgaatt 
6361 tgattttggt gatatcggcg catcgttatt tagtgcggca gggccgggta atcgacctgc 
6421 tggtgtcatg ccgcaaacca agagcattgc ggtcaaatgt acgaatgttg ctgcgcaggc 
6481 ttatttaaca atgcgtctgg aagccagtgc cgtttctggt caggcgatgg tgtcggacaa 
6541 tcaggattta ggttttattg tcgccgatca gaacgatacg ccgatcacgc ctaacgatct 
6601 caatagcgtt attcctttcc gtctggatgc agctgcggca gccaatgtca cacttcgcgc 
6661 ctggcctatc agtattaccg gtcaaaaacc gaccgaaggg ccgtttagcg cgctggggta 
6721 tttacgcgtc gattatcaat gaggtacgga gaatgagaag agtactcttt agctgtttct 
6781 gcgggctact gtggagttcc agtggatggg cagttgaccc tttaggaacg attaatatca 
6841 atttgcacgg taacgttgtt gatttctcct gtaccgtaaa cacagcggat attgataaga 
6901 cggtagattt aggcagatgg cctacgacac aactactgaa cgctggcgat accacggcac 
6961 tcgtcccttt tagcctgcgg ctggagggat gtcctccggg ttcagttgcg attttattta 
7021 cgggaacgcc ggcatccgat accaacctgc tggctctgga tgatcccgca atggcacaaa 
7081 ccgtcgccat cgaattacgt aatagcgatc gctcccggct cgcactgggg gaggcgagcc 
7141 cgactgagga agtagatgca aatggcaatg tcacactaaa cttttttgcc aattatcgag 
7201 cgttagccag cggtgttcgg ccaggtgtgg cgaaagcgga tgcgatattt atgatcaatt 
7261 ataattaata ttatattaat tcgtataatt tggcgtagtc gataagctct acaattgaat 
7321 gcaaacctag cttgccataa atattagatt tatgcgcact aactgtttta ttgctaagta 
7381 ataacttatc ggcaatttct ttattagata atccgctaac cagataacgt aatatggtca 
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7441 cttcacgatt agatagcaca gtgaccgttg aactattcgt actacattta ttgcttttta 
7501 tatagttaag cgtttcgctg ggaaaaaacg tgtatccgga gaggatcatc tgaacggcat 
7561 gaaaaatatc attctgatca ttgcatttac tgacaaaacc gttagcacca gcttgtatcg 
7621 ctctgccagc ataaaagcat tctgatttcg atgataaaaa taacactttc actgtgctct 
7681 ggatttgttt gatccttttc aggaaggtaa aaccgtctgt tccgggcaag tctatatcca 
7741 taatgattaa atcaacagga cgggttcgga gataatcgat ggttatgcga taatcatccg 
7801 ttttcaggac aatctgcaat tcactgtttt tttgcaacag aacttcaata gacattctga 
7851 tgataggatg agtatccata atgatcaccg acgttggttt catagttacc agtctcatag 
7921 gagcggacaa ttttccgtta ggtcgggaaa ttgtactttg atacatgaaa atacgggttt 
7981 tcttgattca gacgcgcagc ggtgtgcgtt tgtttgccgc tatagcgaaa taaatcagaa 
8041 aatcagacgc ggtcgttcac ttgttcagca accagatcaa aagccattga ctcagcaagg 
8101 gttgaccgta taattcacgc gattacaccg cattgcggta tcaacgcgcc cttagctcag 
8161 ttggatagag caacgacctt ctaagtcgtg ggccgcaggt tcgaatcctg cagggcgcgc 
8221 cattacaatt caatcagtta cgccttcttt atatcctcca taatttcaga gtgggacata 
8281 tttgggacat tatcaccaaa aatgtcgtct attttcctcg catgctctgt caaatgatta 
8341 ggcgcaaggt gagcatacct acgaaccatt tctatggact cccatccgcc catttcctga 
8401 agcactgata atgggacgcc tgactgaatc agccagcttg cccaggtgtg tctgaggtca 
8461 tggaaacgga aatcttcaat tcctgcacga cgacaagctg atagccatga tgtcttgctg 
8521 tcgatgcgca tcttcctgac cgcaggcgtt gatgttccat ctgctcgctt agccgccttg 
8581 gtatgtacaa acacccattt gtgatgcttg cctatttgat cacgcaacac tttacaggcg 
8641 gtatcgttca gcgccacacc aatggcgcgg tttgatttgc tctcttctgg attcacccag 
8701 gcaactcgtc gctgcatgtc gatttgttgc cattccagat ttatgatgtt cgactttctc 
8761 agaccagttg ccagcgcaaa cttgacgaca gatttcagtg gttcggggca ctcatcaata 
8821 aggcgttttg cttcctcctt ttccagccat ctgactcgct tgtttctgac cgctggtatc 
8881 ttgatgacag gcgctttttc cagccacttc cagtcgcgtt ctgcagcacg gagaatggcc 
8941 tttatcatgg caagatgctt tgcctttgtc tgagttgata ctggctttgg ttcataaaca 
9001 ggcagttctt tacctttcct gatggcggcc tgaactttct gtttccatat ttctttcgtc 
9061 tttctgttat gcattctgct tacagcagag taaatctttg cctccgagat atctttaagc 
9121 cttataccct caaaatgttc aagccagaac tcaatccggc ttttatctga atcgagagat 
9181 tttttatcag ctttttcctc aagccatctt aggcaggcct cttcaaaagt gacatcaggt 
9241 aaatccccta gcttttctac tcgccagagt tctgcttttc gcttgtcgtg caactcctga 
9301 gcttgccgtt tgtcctttgt gccaagagat tccttaattc gtttcccgcc cgggagcgaa 
9361 tacgaggcat accatatttc atttctgcgg aagagtgaca ttttctttcc tctgttatgc 
9421 catcacccgc gctcacctgg acagtatgca gcggagactg aagcgccgca atgcaggctt 
9481 gccgtgttgt gaggtacccg gatattatcg tgaggatgcg tcatcgccat tgctccccaa 
9541 atacaaaacc aatttcagcc agtgcctcgt ccattttttc gatgaactcc ggcaccatct 
9601 cgtcaaaact cgccatgtac ttttcattcc gctcaatcac gacataatgc aggccttcac 
9661 gcttcatgcg cgggtcatag ttggcaaagt accaggcatc ttttcgcgtc acccacatgc 
9721 tgtactgcac ctgggccatg taagccgatt ttattgcctc gaaaccaccg agccggaatt 
9781 tcatgaaatc ccgggaggta cgagtattgc cggaagcgtg gcctgtatcc ggatgcagag 
9841 tcttatccgt ggaaatcgaa cgcgcattac tggttggtta ccaacttgta ccagaacatg 
9901 cgggccaatg cgctggctga cgcggaatta cggcgcaagg ctgccgatga actgacctgt 
9961 atgacagcgc gaattaaccg tggtgagacg atacctgaac cagtaaaaca acttcctgtt 
10021 atgggcggta gacctctaaa tcgtgcacag gctctggcga agatcgcaga aattaaagct 
10081 aagttcggac tgaaaggagc aagtgtatga cgggcaaaga ggcaattatt cat 
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□ 1: AE000350 . Reports ...[gi:2367147] The record has been replaced by U00096 
Comment Features Sequence 



KEYWORDS 
SOURCE 

ORGANISM 



AE000350 
genome . 

AE000350 U00096 
AE000350.1 GI:2367147 

Escherichia coli K12 



TITLE 
JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



10401 bp DNA linear BCT Ol-DEC-2000 

oli K12 MG1655 section 24 0 of 4 00 of the complete 



K12 



Enterobacter iales ; 



a; Proteobacteria; Gammaproteobactei 
Enterobacteriaceae; Escherichia. 

1 (bases 1 to 10401) 

Blattner, F.R. , Plunkett,G. Ill, Bloch,C.A., Perna,N.T., Burland,V., 
Riley, M., Collado-Vides, J., Glasner, J. D. , Rode,C.K., Mayhew,G.F., 
Gregor,J., Davis, N.W., Kirkpatrick, H.A. , Goeden,M.A., Rose, D. J., 
Mau,B. and Shao,Y. 

The complete genome sequence of Escherichia coli K-12 

Science 277 (5331), 1453-1474 (1997) 

9278503 

2 (bases 1 to 10401) 
Blattner, F.R. 
Direct Submission 

Submitted (16-JAN-1997) Guy Plunkett III, Laboratory of Genetics, 
University of Wisconsin, 445 Henry Mall, Madison, WI 53706, USA. 
Email: ecoli@genetics.wisc.edu Phone: 608-262-2534 Fax: 
608-263-7459 

3 (bases 1 to 10401) 
Blattner, F.R. 
Direct Submission 

Submitted (02-SEP-1997) Guy Plunkett III, Laboratory of Genetics, 
University of Wisconsin, 445 Henry Mall, Madison, WI 53706, USA. 
Email: ecoliSgenetics . wise . edu Phone: 608-262-2534 Fax: 
608-263-7459 

4 (bases 1 to 10401) 
Plunkett, G. III. 
Direct Submission 

Submitted ( 13-OCT-1998 ) Laboratory of Genetics, University of 
Wisconsin, 445 Henry Mall, Madison, WI 53706, USA 
On Sep 9, 1997 this sequence version replaced gi: 1789000. 
This sequence was determined by the E. coli Genome Project at the 
University of Wisconsin-Madison (Frederick R. Blattner, director) . 
Supported by NIH grants HG00301 and HG01428 (from the Human Genome 
Project and NCHGR) . The entire sequence was independently 
determined from E. coli K12 strain MG1655. Predicted open reading 
frames were determined using GeneMark software, kindly supplied by 
Mark Borodovsky, Georgia Institute of Technology, Atlanta, GA, 
30332 [e-mail: mar kf^amber . gatech . edu] . Open reading frames that 
have been correlated with genetic loci are being annotated with CG 
Site Nos., unique ID nos . lor the genes in the E. coli Genetic 
Stock Center (CGSC) database at Yale University, kindly supplied by 
Mary Berlyn. A public version of the database is accessible 
(http: //cqs c.bioloqy. vale. edu ) . Annotation of the genome is an 
ongoing^Task whose goal is to make the genome sequence more useful 
by correlating it with other data. Comments to the authors are 
appreciated. Updated information will be available at the E. coli 
Genome Project's World Wide Web site 

(ht t p://www.qen etic3.wisc . edu ) . *** The E. coli K12 sequence and 
its 'annotations~are periodically updated; this is version M54. No 
sequence changes. Annotation updates: updated gene identifications 
and products; all new functional assignments courtesy of Monica 
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Riley; added promoters, 
sequences described in 
beginning with a lowerc 
RNA-encoding) 



! should allow thei 



protein binding sites, and repeated 
eference 1. The unique numeric identifiers 
se 'b' assigned to each gene (protein- or 
.gnated as gene synonyms instead of 



) be searched for J 



Entrez as gene 



Location/Qualifiers 
1. .10401 

/organism="Escherichia coli K12" 

/mol_type=" genomic DNA" 

/strain="K12" 

/sub_strain="MG1655" 

/db_xref="taxon: 83333" 

complement (212. .4921) 

/gene="ypJA" 

/note=" synonym: b2 64 7" 

complement (212. .4921) 

/gene="ypjA" ^„ 
/function="putative transport; Not classxfied 
/note="fl569; This 1569 aa ORF is 42 pet identical (65 
gaps) to 883 residues of an approx. 864 aa protein 
YEJO_ECOLI SW: P33924; wasORF_f949 of GenBank Accession 
Number U368 40" 
/codon_start=l 
/transl_table=ll 

/product="putative ATP-binding component of a transport 
system" 

/protein_id=" AAC7 5695.1 " 
/db xref="GI: 1789001" 

/translation="MNTIHLRCLFRMNPLVWCLWADVAAKLRSLKRYSVFTFQRMKFM 

NRTSPYYCRRSVLSLLISALIYAPPGMAAFTTNVIGWNDETVDGNQKVDERGTTNNT 

HIINHGQQNVHGGVSNGSLIESGGYQDIGSHNNFVGQANNTTINGGRQSIHDGGISTG 

TTIESGNQDVYKGGISNGTTIKGGASRVEGGSANGILIDGGSQIVKVQGHADGTTINK 

SGSQDWQGSLATNTTINGGRQYVEQSTVETTTIKNGGEQRVYESRALDTTIEGGTQS 

LNSKSTAKNTHIYSGGTQIVDNTSTSDVIEVYSGGVLDVRGGTATNVTQHDGAILKTN 

TNGTTVSGTNSEGAFSIHNHVADNVLLENGGHLDINAYGSANKTIIKDKGTMSVLTNA 

KADATRIDNGGVMDVAGNATNTIINGGTQNINNYGIATGTNINSGTQNIKSGGKADTT 

IISSGSRQWEKDGTAIGSNISAGGSLIVYTGGIAHGVNQETGSALVANTGAGTDIEG 

YNKLSHFTITGGEANYWLENTGELTWAKTSAKNTTIDTGGKLIVQKEAKTDSTRLN 

NGGVLEVQDGGEAKHVEQQSGGALIASTTSGTLIEGTNSYGDAFYIRNSEAKNWLEN 

AGSLTWTGSRAVDTIINANGKMDVYGKDVGTVLNSAGTQTIYASATSDKANIKGGKQ 

TVYGLATEANIESGEQIVDGGSTEKTHINGGTQTVQNYGKAINTDIVSGLQQIMANGT 

AEGSIINGGSQWNEGGLAENSVLNDGGTLDVREKGSATGIQQSSQGALVATTRATRV 

TGTRADGVAFSIEQGAANNILLANGGVITVESDTSSDKTQVNMGGREIVKTKATATGT 

TLTGGEQIVEGVANETTINDGGIQTVSANGEAIKTKINEGGTLTVNDNGKATDIVQNS 

GAALQTSTANGIEISGTHQYGTFSISGNLATNMLLENGGNLLVLAGTEARDSTVGKGG 

AMQNLGQDSATKVNSGGQYTLGRSKDEFQALARAEDLQVAGGTAIVYAGTLADASVSG 

ATGSLSLMTPRDNVTPVKLEGAVRITDSATLTLGNGVDTTLADLTAASRGSVWLNSNN 

SCAGTSNCEYRVNSLLLNDGDVYLSAQTAAPATTNGIYNTLTTNELSGSGNFYLHTNV 

AGSRGDQLWNNNATGNFKIFVQDTGVSPQSDDAMTLVKTGGGDASFTLGNTGGFVDL 

GTYEYVLKSDGNSNWNLTNDVKPNPDPIPNPKPDPKPDPKPDPNPKPDPTPDPTPTPV 

PEKRITPSTAAVLNMAATLPLVFDAELNSIRERLNIMKASPHNNNVWGATYNTRNNVT 

TDAGAGFEQTLTGMTVGIDSRNDIPEGITTLGAFMGYSHSHIGFDRGGHGSVGSYSLG 

GYASWEHESGFYLDGWKLNRFKSNVAGKMSSGGAANGSYHSNGLGGHIETGMRFTDG 

NWNLTPYASLTGFTADNPEYHLSNGMKSKSVDTRSIYRELGATLSYNMRLGNGMEVEP 

WLKAAVRKEFVDDNRVKVNSDGNFVNYLSGRRGIYQAGIKASFSSTLSGHLGVGYSHS 

AGVES PWNAVAGVNWS F" 

complement (4939. .4968) 

/note="factor Sigma70; predicted +1 start at 2780887" 
complement (4 950. .4976) 

/note="factor Sigma70; predicted +1 start at 2780898" 
complement (4 997 . .5024) 

/note="factor Sigma70; predicted +1 start at 2780945" 
complement (5029. .5058) 

/note="factor Sigma70; predicted +1 start at 2780977" 
complement (5058. .5086) 

/note="factor SigmaTO; predicted +1 start at 2781006" 
complement (5074. .5103) 

/note="factor Sigma70; predicted +1 start at 2781022" 
complement (5130. .5273) 
/gene="b2648" 
complement (5130. .5273) 
/gene="b2648" 
/function="orf ; Unknown" 
/note="f47; This 47 aa ORF is 76 pet J 
to25 residues of an approx. 192 aa prote: 
P03014; phage stats" 
/codon_start=l 
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/transl_table=ll 

/product="orf , hypothetical protein" 
/protein_id="AAC7 5696 . 1" 
/db xref="GI: 1789002" 

/translation="MWHLWLLEELCERGINFRALAQSIFAQQWGDECCKSKTICDLK 
VIV" 



compleii 



predicted +1 start £ 



+1 start at 2782471" 



t (5422. .5453) 
ctor Sigma70; 
complement (5703. .6494) 
/gene="b2649" 
complement (5703. .6494) 
/gene="b2649" 
/function="orf ; Unknown" 

/note="f263; This 263 aa ORF is 26 pet identical (20 gaps) 

to 167 residues of an approx. 1640 aa protein MSAP_PLAFW 

SW: P04933; phagestats" 

/codon_start=l 

/transl_table=ll 

/product="orf , hypothetical protein" 
/protein_id="A.A.C75697 . 1" 
/db xref="GI: 1789003" 

/translation="MESRNSYENKIDEISSLSESKEHPIDIQEKKDAFVNEFKGVLFD 
KNTRSSELLFNFYECCYKFLPRAQPQDKIDSYNSALQAFSIFCSSTLTHNNIGFDFKL 
FPEVKLSGEHLETVFKYKNGDDVREIAKINITLQKEEGGLYNLRGLDFKGCFFSGQNF 
SNYDIQYVNWGTSLFDVDTPCIFNAPAYNKSNEKSLKPVSENGLSGVLTDRNNKIKLI 
TGVAPFDDILFMDDDFDDSSSEDDPVENSPWTSPWSSSKSSFQ" 
complement ( 6523 .. 6552 ) 
/note="factor Sigma70; predicted ^ 
complement (6594. .7076) 
/gene="b2650" 
complement (6594. .7076) 
/gene="b2650" 
/function="orf ; Unknown" 
/note="fl60" 
/codon_start=l 
/transl_table=ll 

/product="orf , hypothetical protein" 
/protein id=" AAC7 5698 .1" 
/db xref="GI: 1789004" 

/translation="MLVSKSNGFNASAVLGSGSYNENKSSKHMELLAHSILKLICKEA 
ASETYRGALETLQKMMSECIYQEGNAFVIMGAGEQLKRIKYEVGENNLKVFNVHFNNN 
HELVSSGEPDVICLSKQWENLLIKLKLENNENVFSETKKLSNKNNADQFFECAKRNE" 
complement (7134. .7161) 
/note=" factor Sigma70; predicted - 
7182. .7210 

/note="factor Sigma70; predicted +1 start at 2783172" 
7286. .7417 
/gene="b2651" 
7286. .7417 
/gene="b2651" 

/function="orf ; " ' 

/note="o43; C-t 
/codon_start=l 
/transl_table=rL 

/product="orf , hypothetical protein" 
/protein id="AAC75699. 1" 
/db xref="GI: 2367148" 

/translation="MKIVFQVLEIYISIALVNEHSHAPVTLTKLICLRRVFWSWDV" 
complement (7 827. .7902) 
/gene="ileY" 
/note="synonym: b2652" 
complement (7827. .7902) 



+1 start at 2783082" 



ffers from earlier \ 



/qem 



ileY" 



/product="tRNA-Ile" 
/note="anticodon: CAU; 
C6:G67base pair in amir 
/function="RNA,- tRNA" 
complement (7865. .8038) 
/gene="b2653" 
complement (78 65. .8038) 
/gene="b2653" 
/function="orf ; Unknown" 
/note="f57; This 57 aa ORF i 
to37 residues of an approx. 544 aa 
P98055" 

/codon_start=l 
/transl_table=ll 

/product="orf , hypothetical proteii 
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/protein_id=" AAC75700 . 1" 
/db xref="GI: 1789006" 

/translation="MEAVSGFMSRVITTQGEWERIKICYLHHMIIMFIAWFISIVIAH 

SRPFSSWRAGDS" 

complement (8183. .8211) 

/note="factor Sigma70; predicted +1 start at 2784131" 
8316. .8344 

/note="factor Sigma70; predicted +1 start at 2784306" 
8462. .8794 
/gene="b2654" 
8462. .8794 

/gene="b2654" 
/function="orf ; Unknown" 

/note="ollO; phage stats; This 110 aa ORF is 31 pet 

identical (2 gaps) to 83 residues of an approx. 384 aa 

protein RMAR_WILMR SW:P47906" 

/codon_start=l 

/transl_table=li 

/product="orf , hypothetical protein" 

/protein_id="A.<^C75701.1" 

/db xref="GI:1789007" 

/translation="MFSIKPGPRNLPIDNPTLLSWNITDGDLNSKLNTLEYLNCITNI 

INSCGVYPQGLKDREIISTFHAEKVINDLLKNDYKISLSPDTTYRELNKAAQRSITAP 

DRIGERKT" 

8813. .9307 

/gene="b2655" 

8813. . 9307 

/gene="b2655" 

/function="orf ; Unknown" 

/note="ol64; ol64; phage stats; This 164 aa ORF is 30 pet 

identical (4 gaps) to 73 residues of an approx. 488 aa 

protein YJQ2_YEAST SW:P46997" 

/codon_start=l 

/transl_table=ll 

/product="orf , hypothetical protein" 
/protein id=" AAC75 702 ._1" 
/db xref="GI: 1789008" 

/translation="MIERGDNSGVYQYGRAEHFTHIISDKPSPKDKYVAYAINIPDYE 
LAADVYNINVTSPSGQQETFKILINLEHLRQTLERKSLTAVQKSQCEIITPKKPGEAI 
LHAFNATYQQIRENMSEFARCHYGYIQIPPVTTFRVRRTRNSRRRKGLLVSCLSTRRS 
LYHP" 

9312. .9500 
/gene="b2656" 
9312. .9500 
/gene="b2656" 
/function="orf ; Unknown" 

/note="o62; o62; phage stats; This 62 aa ORF is 29 pet 

identical (0 gaps) to 47 residues of an approx. 472 aa 

protein AMY1_AERHY SW:P22630" 

/codon_start=l 

/transl_table=ll 

/product="orf , hypothetical protein" 
/protein_id="MiC75703. 1" 
/db xref="GI: 1789009" 

/translation="MGDLQDFIALVKDAKKFGIDIIPDYTFNFMGIGGSGKNDLDYPS 

ADIRAKISKDIEGGIPGY" 

9379. .9396 

/gene="b2656" 

/note="central position to predicted promoter: -233" 
/bound_moiety="PhoB predicted site" 
9583. .9615 

/note="factor Sigma70; predicted +1 start at 2785577" 

9672. .10304 
/gene="b2657" 
9672. .10304 
/gene="b2657" 

/function="putative enzyme; Not classified" 

/note="o210; o210; phage stats; This 210 aa ORF is 28 pet 

identical (19 gaps) to 187 residues of an approx. 472 aa 

protein AMYl_AERHY SM:P22630" 

/codon_start=l 

/transl_table=ll 

/product="putative enzyme" 

/protein id=" AAC7 5704 . 1 " 

/db_xref="GI: 1789010" 

/translation="MPKTTPKSDQVIMLQNQYVREMRKYGVRGLRYDAAKHSKHEQIE 
RSITPPLKNYNERLHNTNLFNPKYHKKAVMNYMEYLVTCQLDEQQMSSLLYERDDLSA 
IDFSLLMKTIKAFSFGGDLQTLASKPGSTISSIPSERRILININHDFPNNGNLFNDFL 
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FNHQQDEQLAMAYIAALPFSRPLVYWDGQVLKSTTEIKNYDGSTRVGGEA" 
jxawotfT 10315.. 10342 

/note="factor Sigma70; predicted +1 start at 2786304" 

-vrr-ir-ofer 10401. .>10401 
/note="factor Sigma70; predicted +1 start at 2786391" 

N ^ ^ 

1 atcaatttaa caacaagtta cacaaccaat tcagactccg ccagcccacc aatcatgatt 
61 ggacggtgta aggacaacac caacaaaaac aggaagttag aagtctcagc aaaacaccga 
121 ccagacggtg aggagacata aaaggatacg caaaggagcc gcggctcctg gtaacatgaa 
181 agcccacaga tgtgggcttt ttcgttgatg gtcagaacga ccagttcaca ccagctaccg 
241 cgttccacgg ggattccaca ccggcactat ggctataccc caccccaaga tgcccgctta 
301 acgtactgct gaatgaggct ttaatacctg cctggtatat tccacgtctg cccgacaaat 
361 aattgacgaa attaccgtca ctattcactt tcacccggtt atcatcgaca aattctttgc 
421 gcacagccgc cttcagccac ggctcaactt ccataccgtt ccccagacgc atgttgtaac 
481 tcagcgttgc gcccagttca cgatatatac tgcgggtatc gactgatttc gatttcatgc 
541 cattggataa atgatattcg gggttatcag cggtgaaccc cgttaacgat gcatacggcg 
601 tcaggttcca gttaccatcg gtaaatcgca tcccggtttc aatgtgaccg cccagcccgt 
661 tgctgtggta actgccattg gcggctccac cgctgctcat tttacctgct acgttacttt 
721 taaaacggtt cagcttcacg acaccgtcca gatagaaacc actttcatgt tcccaactgg 
781 catagccgcc cagagaataa ctgcccacac tgccatgtcc tccgcgatca aaaccgatat 
841 gtgaatggga atagcccata aaagcgccta gcgtggtaat tccttcagga atatcattac 
901 ggctgtcgat ccccactgtc attccggtca gcgtctgctc aaacccggca cccgcatcgg 
961 tggtgacatt attacgggtg ttatacgtcg ccccccagac attattgttg tgtggactcg 

1021 ctttcattat gttcaaccgc tcgcgaatac tgtttagctc agcatcaaat accaacggta 
1081 atgttgctgc catattgagt acggctgccg tagaaggcgt aatgcgtttc tccggaacgg 
1141 gtgtcggcgt tggatcggga gtagggtctg gttttggatt tgggtctggc tttggatcgg 
1201 gttttgggtc tggctttgga tttgggatgg ggtccgggtt gggtttgaca tcattggtca 
1261 ggttccagtt gctgttgccg tcacttttca ggacatactc ataggtccca agatcaacga 
1321 aaccgccggt attgcccagc gtaaacgaag catcccctcc ccctgttttc accagcgtca 
1381 tcgcgtcgtc agactgtggg ctgacgccgg tatcctgaac aaagatttta aaattaccag 
1441 tggcgttgtt gttgacgacc agttgatcgc cccgggagcc tgcaacgttg gtatgcaggt 
1501 agaaattacc gctaccggaa agttcattgg ttgtcagcgt attgtagata ccgttagttg 
1561 tggcaggcgc tgctgtttgt gctgacaaat aaacatcacc gtcgttgagt agcaaactgt 
1621 ttactctata ttcgcagttg ctggtacctg cacaggaatt attgctgtta agccagacac 
1681 tgccccggct ggcagccgtc aggtcggcaa gcgtggtatc gacgccattt cccagagtca 
1741 atgtcgcgct atcggtaatc cggaccgccc cttcgagttt aactggcgta acattatccc 
1801 gtggcgtcat taacgacagg cttcctgtcg cgccactgac cgatgcatcc gccagcgtac 
1861 ctgcgtagac gattgccgta ccgccagcga cctggagatc ttctgcccgg gccagagcct 
1921 gaaactcatc ttttgaccgc ccaagggtat attgcccgcc agagttaacc tttgtggcgg 
1981 agtcctgacc caggttttgc attgccccac ccttgccaac cgtggagtcg cgagcttcgg 
2041 tacctgctaa taccaataaa ttaccgccat tttccagcaa catattggtc gctaaattgc 
2101 cggaaatgga aaaagtaccg tactggtgag taccgctgat ttcaataccg ttagccgtgc 
2161 tcgtctggag agcggcaccg ctgttctgga cgatatctgt cgctttgcca ttatcgttga 
2221 ctgtcagcgt accgccttca ttgatctttg tttttattgc ctctccgtta gctgaaactg 
2281 tttgtattcc gccgtcgtta attgttgtct cattcgccac accctcgaca atttgttcac 
2341 cgccggtgag cgtcgtgcct gtcgcagtgg cttttgtttt gacgatctcc cgtccgccca 
2401 tattgacctg tgttttgtca gaagaggtgt ctgactccac ggttaacacg ccgccatttg 
2461 ccagcaggat attgttcgcc gcaccctgct cgatgctgaa cgcgacgcca tccgcgcgtg 
2521 ttcctgtgac ccgcgtcgcc ctggtggttg caaccaaagc gccctggcta ctctgctgta 
2581 tccccgttgc gctgcctttc tcccgcacat cgagtgtgcc gccgtcatta agcaccgagt 
2641 tttcagccag gccgccctca ttaactacct gtgaaccccc attaataatg gaaccttccg 
2701 ctgtcccgtt tgccataatt tgttgtaggc cagagacgat atcggtattg attgccttac 
2761 cataattctg aacggtttgc gtgccaccat tgatgtgtgt tttctctgtt gacccaccat 
2821 caacaatttg ttcaccactt tcgatatttg cttcagtggc taaaccatat accgtttgct 
2881 tgccaccttt gatatttgct ttatcagaag tggcactggc atatattgtt tgggtgccag 
2941 cactattgag tacagtgcca acatcttttc cataaacatc cattttgccg ttggcattaa 
3001 taatcgtgtc aactgcgcgg gaaccagtga cgactgttaa tgagccagcg ttttccagca 
3061 ctacattttt agcttctgaa ttcctgatgt agaaagcatc accataactg ttggttcctt 
3121 cgataagtgt tccggaggtc gtggaagcaa ttaatgcgcc gccggattgt tgctcaacat 
3181 gcttagcctc accaccgtcc tgaacctcca gaacgccgcc attattaagt ctggtgctat 
3241 ctgttttagc ctccttctgg acaatcagct taccgccagt atcaatggta gtatttttcg 
3301 ccgaggtttt agccactacc gtcagttcgc cggtattttc cagcacaaca taattagcct 
3361 cccctccggt aatagtgaag tgagagagct tgttgtatcc ttcgatatca gtccctgcac 
3421 ccgtgttggc aactaaagca ctgcccgtct cctggttaac cccatgtgca ataccgccgg 
3481 tatagacaat cagcgagcct ccggcgctaa tattgctgcc aattgccgta ccatctttct 
3541 caacaacctg ccggctcccg gaggatataa ttgttgtgtc agctttcccg ccgcttttga 
3601 tattttgcgt tccgctgttg atattggtgc ctgtggctat gccataatta ttaatattct 
3661 gtgtgccacc attaattatg gtatttgtcg cgtttcctgc aacatccata accccgccat 
3721 tatctattcg ggtcgcatca gctttagcat tggttaaaac tgacattgtt cctttatctt 
3781 taataatcgt cttgtttgcc gaaccatatg cgtttatgtc taaatgacca ccgttttcca 
3841 gcaacacatt gtctgccacg tgattgtgga tggagaatgc accttcacta ttcgtaccgc 
3901 tcaccgtcgt accgttagtg ttagttttta aaattgcacc atcgtgctgg gtaacatttg 
3961 ttgccgtacc acccctaaca tcaagcacgc caccagaata aacttcaata acatccgagg 
4021 tgctggtgtt atcaacaatt tgcgtgccac cagaatagat atgcgtattt tttgccgttg 
4081 acttactatt cagagactga gttccgcctt caatcgtogt gtccagcgca cggctctcat 
4141 atactctttg ctcaccgcca tttttaatgg ttgttgtttc tactgtgctc tgttcaacat 
4201 actgtcgacc accatttatg gttgtgttcg ttgccagact tccttgtact acgtcctgag 
4261 agccagactt atttatcgtt gtaccatcag catgcccttg aacttttact atctggctac 
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4321 caccatcaat gagtattcca ttcgcactcc 
4381 ttgtcgttcc attgctgata ccccctttat 
4441 tacctgtgga aatacccccg tcatgaattg 
4501 tagcctgccc cacaaaattg ttatgacttc 
4561 gacttccatt agataccccg ccatgaacat 
4621 tatttgttgt acctcgttca tccacttttt 
4681 caccaataac attagtagtg aaggcagcca 
4741 tcaataagga aagtactgag cggcgacaat 
4801 tcctctgaaa agtgaatact gagtagcgtt 
4861 cccacaggca ccagaccagg gga' 



ctccctctac gcgtgaagca ccgcccttaa 
aaacgtcctg attgccactc tcgattgtcg 
actgtctgco accgttaatg gttgtattat 
ctatatcttg atatccacca gattcaataa 
tctgctggcc atggttgata atatgagtgt 
ggttgccatc tacagtctca tcgtttacca 
tcccgggcgg ggcatatatc aaggcagata 
aatagggact ggtcctgttc ataaatttca 
taagcgacct tagctttgct gcaacatcag 
tgaagagaca gcgcaagtgt attgtgttc; 



gatgaaatga tgatattccg cataagaatg aggcattttt 

4981 taaacgcagt gcgctgaagt gtggttggat aaaaaagtca atccattcag gaaatacggg 

5041 cgtattcttt tctttcgaca atgagccgtt gccaaaataa aatgatttac ataatcgttt 

5101 ctgatgaata tcttctgctc acataaaaat cacacaataa ctttgagatc gcagattgtt 

5161 ttacttttac agcattcgtc cccccattgt tgggcaaata tagattgggc cagagcacga 



5221 aagttaatac cacgttcg( 
5281 cccagtcggt tcagtttai 
5341 tttttcagtc egg; 
5401 ttcttcagtc aggcgtggtg 
5461 tcaggcagct aaggaaagtt 
5521 ccaattcagt cactgccagc 
5581 tttttctttg gtagtgacac 
5641 tcaataatac atattgtttc 
5701 aatcattgaa aactgctttt 
5761 tcaacgggat catcctcaga ggaactatca 
5821 tcgaatggtg ccacgcccgt gatgagtttt 



tec aacagcacga caagatgcca catactgcgc 
ictagcagt gcgttccctg ccgatgatgt cctgaccagt 
;cacggttt tacgttgaag tggtaaccct gagcacgcag 
iccgtaacg ctgttattgc tgggtaagat caaaaaactt 
laccagaca ttagatgaaa tatttcaacc aattacagca 
;accaaata aatcaagggg ttacatgaaa acgtagcccc 
laaatggat gtagtgtgaa gaataatccc gtttactcaa 
itctacgtt attatctctt tgtaaaaatt gccatttatt 
tgat acaacgggac tagtcacaac aggactattc 
:caaagtcat catccataaa taaaatatca 
ittttattat tacgatcagt caatactcca 
5881 cttaaaccgt tttcgctcac aggttttaat gatttttoat tactcttgtt gtaagcaggc 
5941 gcattaaaaa tacacggagt atcaacatca aacaatgacg ttccccagtt cacatattga 
6001 atatcatagt tactgaagtt ctgtccagaa aagaagcacc ccttaaaatc caatccacgt 
6061 aaattatata aaccaccctc ttctttttgg agagtaatgt taattttggc tatctcccgg 
6121 acatcatcgc catttttgta tttgaatacc gtttcaagat gttctccaga cagcttgact 
6181 tcaggaaata atttgaaatc aaagcctata ttattatgtg tcaacgtaga tgaacaaaaa 
6241 atggaaaaag cttgcagtgc tgaattatag ctatcgattt tatcttgagg ctgagctctt 
6301 ggtaaaaact tatagcaaca ctcataaaaa ttaaacagaa gctctgaaga ccttgtattt 
6361 ttatcaaata ataccccctt gaactcattc acaaacgcat cttttttttc ttgaatatct 
6421 atggggtgtt cctttgactc tgacaaagat gaaatctcat ctattttgtt ttcatatgaa 
6481 ttacgtgatt ccatacagac atttggcggc gtttctaaaa taacactacg cgtactactt 
6541 ggcttaagta aaccaacatg aaaatcactt tttcttatat tatcgaaaag gttctattca 
6601 tttcttttag cgcattcaaa aaactgatcg gcattatttt tattcgataa ttttttagtt 
6661 tcagaaaaca cattttcatt gttttccagc tttagtttaa tgagaagatt ttcccagacc 
6721 tgcttgctta aacatattac gtcaggctca ccagaactaa ctaactcgtg attattatta 
6781 aagtgtacgt tgaatacctt taagttattt tcaccaactt catatttaat acgttttaat 
6841 tgttctccag ctcccataat gacaaaggcg ttgccttctt gatatataca ttcagaoatc 
6901 attttttgta aagtttcaag agcaccgcga tacgtctctg atgcagcttc cttacaaatt 
6961 aattttaaaa tactatgagc tagtagctcc atgtgtttag aagattt--'-- ..4--.,*..-, 4-,, 
7021 cttccactac ccaaaactgc gctagcgtta aatccgttgc ttttac 
7081 tattcctcaa ttaatgtcta catggctatt tttaatgtta ttactgtttg 
7141 aaaacgctca tttgagacaa ttactgacat taactgcttc acttgctacg 
7201 ttaattaaat tagcacagga atgttaaatt taataaacaa aaggttattt 
7261 taaaaaaaac cgttataatt tattagtgaa aatcgttttt caagtgttag 
7321 ctcaatagcg ttggttaatg agcatagcca cgctcctgta acgctcacaa 
7381 cctgcggcgg gtgttctggt cagtagtaga tgtttaaggc gtggcagaga 
7441 ttactctacg gcattgttct acatacattg gttgtggtac 
7501 aacagagaat agttcagtga tttgagtaat taacctgatt 
■ ■ :gttaattac ttaattccac 
:caggtactt ttgaatgagt 
laacaccagc cctcattcga aac' 
;gaaatatag accgccatcg agt 
tgatttggt ggcccttgct gga( 



: taacattgtc 



7561 tgataatact ctggctttat 
7621 tggcatagca ggcatttttt 
7681 cagtggcgtg ccacgtacca 
7741 ccttgaaatg gcgttagtca 
7801 ctcttcctga tacgtaaati 
7861 gcgattatga gtcgcctgc 
7921 tacttataaa ccacgcaat. 
7981 gttcccattc accttgggti 
8041 gcaaagtcga tactcgcagt 
8101 catcgattca atgatacttt gt 

8151 gctgaagagt tttcgcatca ttatgatatc tgttactttt 
8221 cgtatagcaa tatattcttt catagatctt attctgctaa 
8281 atttttgaca gttgcctctc cagaccacat cgataattaa 
8341 tccttttcca tataaatatt ggataaaagt aggacatctg 
8401 aattaaacat ttttatgttt ccgtatacat catattactc 
84 61 tatgttttct ataaaaccag gacccagaaa tttacctatc 
8521 atggaacatt actgacgggg atctaaattc caaattaaat 
8581 tataacaaat attattaatt cttgtggagt t 
8641 tatatcaact tttcacgcag 
8701 ttccctttct ccagatacaa 
8761 agcgccagac aggataggag 
8821 aagaggtgat aacagcggtg 
8881 atctgacaaa ccttccccaa 
8941 tgagctggca gccgatgtat 



aactcatctg 

tcagtgagcg 
aaatgaaggg gtataataaa 
atgtaagcaa tttgcccgct 
actgatggat aaatacattg 
cccac cgcacttctt 

tgaac cagcgaccaa 
tgagtgt gcaataacaa 

acccgacacc gcctccattc 
ttagcactat ctatagttat 
aatataactt tattacatta 



: ctaaccactg agctaaaggg 
L aacatgatga tcatatgatg 
: gtgattacac gcgacataaa 
;aacaagcaa atgttaataa 



tcattagttt cgtatgagcg 

taaaacagat ttaagcatta 

tttgcaatta ctttcacaac 

taccattaga ggaactttat 

gacaacccca cattgttatc 

acattagaat atctaaactg 

ggattaaaag acagagaaat 

ttaaaaaacg attataaaat 

gcagcacagc gtagcattac 

aagaaaaac ataggtttat caacgagata caatgattga 

ttatcagta tggtcgtgct gaacatttca cccacattat 

agataaata tgttgcatat gctattaaca ttcctgacta 

cgtgacgtca ccttccggac agcaagaaac 
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9001 atttaaaata ttaatcaatc tagaacatct acggcaaaca cttgaacgta aatctcttac 
9061 tgctgttcag aaatcacaat gtgaaatcat cacccccaaa aaacctggcg aagcgattct 
9121 tcatgctttt aatgccacct accagcagat cagagaaaat atgtctgaat ttgcacgttg 
9181 ccattatggg tatatacaaa tccctccagt gacaactttc cgcgtccgac ggaccagaaa 
9241 ctcccgaaga agaaaagggt tactggtttc atgcttatca acccgaagat ctttgtacca 
9301 tccataatcc aatgggagat ttgcaggatt ttattgcatt ggttaaagat gctaaaaaat 
9361 ttggtatcga tatcattcct gattatacct ttaactttat gggaattggg ggtagtggta 
9421 aaaatgacct ggattatccc tctgctgata tacgagcgaa gatcagtaaa gatatagaag 
9481 gtggtatccc tggctattag caaggtcagg ttttgattcc attcattaaa gatccagtaa 
9541 caaaagaacg taaacaaatc catccagaag atatacatct cactgcaaaa gacttcgaag 
9601 caagtaaaga taacatctct aaggatgaat gggaaaacct ccatgcatta aaagaaaagc 
9661 gtttaaatgg aatgcctaaa acaacaccca aaagtgacca ggttattatg ttgcaaaatc 
9721 aatacgttcg tgaaatgcga aaatatggcg tacgaggttt acgttatgat gcggcaaaac 
9781 actcaaaaca tgaacaaata gaaagatcaa taaccccacc gcttaaaaat tataatgagc 
9841 ggttacacaa tactaactta tttaacccaa aatatcataa aaaagccgtt atgaattaca 
9901 tggaatatct ggtaacttgt cagttggatg aacaacaaat gtcatcactg ctttatgaaa 
9961 gagatgattt aagcgccatt gatttttcat tgctcatgaa aacgataaaa gccttttcat 
10021 ttggtggaga tcttcaaacc cttgcatcaa aaccgggttc cacaatctca agtatcccat 
10081 cagaaagacg gatattgatt aacattaacc acgattttcc taacaatggt aatcttttca 
10141 atgactttct atttaaccat caacaagatg aacaattagc aatggcatat atagccgctc 
10201 tgccgttcag caggccttta gtttactggg atggccaagt attaaaatca acgactgaaa 
10261 ttaaaaatta tgatgggtcc acgcgtgtcg gcggtgaggc gtagcttaat aaaggttgct 
10321 ctacctatca gcagctctac aatgaattcc acgcattata tatagataaa gcaggaatat 
10381 ggagcgcatt tgagggtgta t 
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ECOTGOP 1100 bp DNA linear BCT 18-APR-1994 

E.coli tRNA operon: supB-E tRNA gene cluster coding for 7 tRNAs. 
J01713 

J01713.1 GI:147952 

glnU gene; glnV gene; leuW gene; metT gene; supB region; supB-E 
transfer RNA operon; supE region; transfer RNA; transfer RNA-Gln; 
transfer RNA-Leu; transfer RNA-Met. 
Escherichia coli 

Escherichia coli 

Bacteria; Proteobacteria; Gairanaproteobacteria; Enterobacteriales; 

Enterobacteriaceae; Escherichia. 

1 (bases 1 to 1100) 

Nakajima,N., Ozeki,H. and Shimura,Y. 

Organization and structure of an E. coli tRNA operon containing 

seven tRNA genes 

Cell 23 (1), 239-249 (1981) 

6-163550 

2" (bases 897 to 899) 

Nakajima,N. , Ozeki,H. and Shimura,Y. 

Organization and structure of an E. coli tRNA operon containing 

seven tRNA genes 

Cell 25, 581-581 (1981) 

3 (sites) 

Nakajima,N. , Ozeki,H. and Shimura,Y. 

In vitro transcription of the supB-E tRNA operon of Escherichia 
i of transcription products 
1), 11113-11120 (1982) 



I66S2 
(base 



Ooi,T. 

Onpubli 
Origina 



1 



> 1100) 



;hed (1983) 



I coli DNA. 



Escherichi; 
] in corrigendum. 
[3] sites; transcription initiation and termination. [4] sent on 
tape from Kyoto University. Entry kindly reviewed by H.Ozeki 
(29-AUG-1984) . The second tRNA, previously called 'X-tRNA', has 
been tentatively identified as Leu-tRNA. 

All seven tRNAs are found on one primary transcript in vitro [1]. 
The DNA (bp 895-920) could form a hairpin structure of 12 bp, whicl 
is consistent with rho-independent termination sites. In the RNA 
termination site 'ctt' (bp 920-922) the first 't' is preferred 
(70%), the 'c' is second (22%) and the second 't' accounts for 5% 
of the RNA terminations [3] . 

Location/Qualifiers 
1. .1100 

/organism="Escherichia coli" 
/mol_type="genomic DNA" 



/db_3 



:ef=' 



I -. 5 62 " 



270. .354 

/product="tRNA-Leu" 
/note="codon recognized: CUA" 
378. .452 

/product="tRNA-Gln" 
/note="codon recognized: CAA" 
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tpm 487.. 561 

" ■ ~ /product="tRNA-Gln" 

/note="codon recognized: CCU" 
tRNA 57 7.. 653 

/product="tRNA-Met" 
/note="codon recognized: CUA" 
tOTA 701.. 775 

/product="tRNA-Gln" 
/note="codon recognized: CAG" 
tRNA 813. .887 

/product="tRNA-Gln" 
/note="codon recognized: AAU" 
ORIGIN 113 bp upstream of Hindi site; 15.5 min on K12 map. 

1 atggctgtag ggtgggaaaa aacacaaagt aaacaataat tgacgaatat agcgccacgc 
61 tgttcgcaac ctaaccaaac agtcactttc gagcaatttt ccttgaaaaa gaggttgacg 
121 ctgcaaggct ctatacgcat aatgcgcccc gcaacgccga taaggtatcg cgaaaaaaaa 
181 gatggctacg tagctcagtt ggttagagca catcactcat aatgatgggg tcacaggttc 
241 gaatcccgtc gtagccacca tctttttttg cgggagtggc gaaattggta gacgcaccag 
301 atttaggttc tggcgccgca aggtgtgcga gttcaagtct cgcctcccgc accattcacc 
361 agaaagcgtt gtacggatgg ggtatcgcca agcggtaagg caccggtttt tgataccggc 
421 attccctggt tcgaatccag gtaccccagc catcttcttc gagtaagcgg ttcaccgccc 
481 ggttattggg gtatcgccaa gcggtaaggc accggttttt gataccggca ttccctggtt 
541 cgaatccagg taccccagcc atcgaagaaa caatctggct acgtagctca gttggttaga 
601 gcacatcact cataatgatg gggtcacagg ttcgaatccc gtcgtagcca ccaaattctg 
661 aatgtatcga atatgttcgg caaattcaaa accaatttgt tggggtatcg ccaagcggta 
721 aggcaccgga ttctgattcc ggcattccga ggttcgaatc ctcgtacccc agccaattta 
781 ttcaagacgc ttaccttgta agtgcaccca gttggggtat cgccaagcgg taaggcaccg 
841 gattctgatt ccggcattcc gaggttcgaa tcctcgtacc ccagccacat taaaaaagct 
901 cgcttcggcg agctttttgc ttttctgcgt tcattcaatg tcgaatgcga tgttgacacg 
961 tcttatcctt caatgtcgga tgcgacgctg ccgcgtctta tccgacctac ggttggcacg 
1021 catccggcaa tgttgtaagg ctacaaccct aacgcatatt tcagcgcctg acgtttcaac 
1081 acgccagcac gctccgccgc 
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ATTACHMENT D 




THE UNTIED STATES PATENT AND UtADEMARk OCTICE 



/Application of: Caitens» Carsten-PBter 



Serial No. 



09/492,590 



Exsminer: 



Q. LeffersJr. 



Mo 



Filed; 



Jannaxy 27, 200Q 



Group: 



1636 



Entitled: 



METHODS AND COMPOSITIDNS FOR 




HIGH LEVEL EXPRESSION OF A 



HETEROLOGOUS PROISN WITH POOR 



CODCM USAGE 



Attorney I>ocket No.: 25436/1340 IFomaerly 4J 14/85530] 



Assistant Conunissioncr forP^nts 
Washington, D.C. 2023 1 

RULE 132 DECXARATION OF MARY BUCHANAN 

' LMattJMam hereby declare thai: 

1 . I am employed by Stcatagene. the assignee of record of the above-ieferesccd 
patent application. I hold the position of Diiector of Product Management at Stratagene. 

2. Scratagene's CodooPlus™ produces contain tRNA genes corresponding to rarely 
used codons. Stratagene presently sells six different BL2l-CodoiOPlu5™ bacterial strains for use 
in enhancing the expression of protdn encoded by genes containing rarely used codons. Baterial 
strains BL21-CodonPlus™ RIL (Catalog # 230240), BL21-CodooPIus™ (DE3)-RJL (Catalog # 
230245), and BL21-CodonPhis (DE3)-Rn/-X (Catalog # 230265), are embodiments of the 
invention claimed in daims 1-12 and 37. Each of the above competent cell products contains 
thme tRNA genes corresponding to rarely used codons consisting of argU, ileY, and leuW. 
Bacterial strahis BL21-CodonPlus™ RP (Catalog # 230250), BL21-CodonPIns'™ (DE3)-RP 
(Catalog # 230255). and BL21-CodonP]n3(DE3>-RP-X (Catalog # 23(X275), are embodiments of 




the invention claimed in daims 39-41 and 44. Each of the above comiKtextt cell products 
contains two tiflMA genes corresponding to f aiely used codons consisting of argU and proL. 

CodonPlus™ host ceD prtsducts were the first commercially available compositions to 
successfully allow high level expxession of genes encoding rarely used codons. Sales flgu^ 
provided below indicate the commerdal success of these embodimnits of the clcdmed invention 
(Table I). The products became ay<ulable in May 1999 in die United States and continue to sell 
today. 



Table I 









Number 


Date 




Ymr 


Calaliw* 


Frodutt 


rftKNA 
genes 


Introduced 


Total S»lBi 


1599 


230240 


BL21-CodoiiPIus'»'-WL CompBtent Cells 


3 


5/24/99 


5 103>tS2 


1999 


230245 


BL21-CodoiiPlus~CDE3)-RIL-Coinpetcnt Ctlk 


3 




325.143 


1999 


230250 


BL21-CodonPlus™JRP Compeiem Cells 


2 


9/Z7/99 


1B.764 


1999 


2302S5 


BL21-CodonPlo*™<DE3)-RP Competent Cells 


2 


8/27/99 


46.052 










1999 Total 


493^1 


2000 


230240 


BL2l-CodonPliis™.RIL Competent Cfeils 


3 


5/24/99 


59.265 


2000 


23024$ 


BL2l-C:od0nPl[i3™<DE3)-KIL CornpetBat Cells 


3 




226,181 


2000 


230350 


BL21-CodoiiPlus'««-RP Coinpctent CeUs 


2 


8/27/99 


51.210 


2000 


230255 


BL2I-CodoaPlus™PE3)-RP Compelent Cells 


.• 2 


8/27/99 


144.118 


2000 


230265 


BL21 .CodonPlus^MCDES^-RIL-X Compeoem Cell* 




1/14/00 


5J17 


2000 


230275 


BL21-CodonHuafM(DE3>RI'-X Conpeisnt Calls 


2 


1/14/00 


3.429 








2000 Total 


489^ 


2001 


230240 


BL21-CodonPlusf>«-RIL Competent Cells 


3 


5f24f99 


5.174 


2001 


230245 


BLSl-CodonPlus^CDES^RO. Competeot Cells 


3 


5/24/99 


17.230 


2001 


230250 


BLZl-CDdoiiPIiu'"<-RP Competent Cells 


2 


V27I99 


3377 


2001 


230255 


BL21^d(MPlw««(PB3)-IlP Competent Cells 


2 




15,610 


2001 


230265 


BL21-CodoiiPluiiM(DB3>-RQ^X Compecent Cells 


3 


1/14700 


162 


2001 


230275 


BL21-CodoiiFIus'nx(DE3>-RF-X Compcitent Cells 


2 


1A4/D0 


162 










2001 Total 


41,715 










TOTAL: 





The ptunaiy reason one would use the Codon£>Ius™ series host cells is to increase the 
expression of genes containing low frequency codons. The CodonPbs^ series host cells are 
sold at S19S per nil» which is 44.4% higher than the same competent cells without the anay of 
tRKAgenesCBLSl, Catalog #200133 and BL21 CDE3). Catalog #2001 31). Essendally aU of 
the customers who purchased the CodonPlus™ cells used them for the purpose of increasing the 
expression of heterologous genes oocoding raiely used codons because ttieie is no reason to use 
the more expensive CodonPlvs host strains for routine exptessicm of genes without low 



2 




£[equency codofi problems. Theiefoie, the comTnercial success of the CodonPlus^ setics host 
cells is due lo the presence of an array of two or three IRNA genes encoding rarely u$ed codons. 

3. The market for bacterial host cells is shared by ft niimber of major companies, 
including Invitrogen. Promeg<», Novagen. and life Technologies. The consumer is thus free to 
choose apr«}diict on ihe basis of objective principles. Hie commercial success Stratagene 
enjoyed In seUing CodonPlus™ host cells is not the result of heavy promotion or advertising, nor 
the result of consumptirm by pui^as«s normally tied to Stratageae. Stratagene .spent na more 
on promotion and advertising of the CodonPlus™ host cells than it did on any of the other new 
competent cell products it sells. For tsxample, XLKKiold® Ultracompeteat cells (Catalog NTos. 
200314, 200315 and 200317) were rdeased in May 1997. Sales in 1997 were $188,474 
worldwide, $592,193 in 1998. La:unch costs were siniilar for both products. 

4. I hereby declare thai all stacKnents made herein of my own knowledge are true, 
aiid that all statements made on infonnation and belief are believed to be true; and further, that 
these statements were made with the knowledge that willful, false statements and the like so 
made are punishable by floe or imprisonment, or both, under Section 1 001 of Title 1 8 of the 
United States Code, and Uiat such willful false statements may jeopardize tha validity of the 
application or any patent issuing thereon. ^ 



Febmarv S. 2001 




Date 



Director of Product Management 
Stratagene 

1 101 1 North Toney Pines Road 
La J6Ua,CA 92037 
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ATTACHMENT E 



IN THE UNITED ST A: 




Express 
Date of 



)f Depo£ 



Label NO.: EL3269231S5US 
»slt: Novenibei: 9 . 2001 



AND TRADEMARK OilTCE 



ApplicatiOTi of: Carstens, Carstcn-Pcter 
Serial No.: 09«92^90 

J«Taaiy27,2000 



Filed: 
EndtLed: 



METHODS AND COMPOSXIlONS FOR 
HIGH LEVEL EXPRESSION OF A 
HETEROLOGOUS PROTEIN WITH POOR 
C0)DON USAGE 



Exdminer: G. Leffeis Jr. 
Group: 1636 



Attorney Docket No.: 25436^1340 tFormeriy 4114/85530] 

Assistant Commissioner fo* P*t«nt5 
Washington. D.C. 20231 

Q^pn.W^A'-'>'^^-^WATTONOFMARY BUCHANAN 

I, \/fan> Buchanan, hereby declare, supplemental to *e Rule 1.132 Declaration filrid oa 
R>hmary9,200i,that: | 

1 . I am employed by Stratageae, the assigoee of record of the abovc-refe*eiic|d 
patent applicadon. i hold the positiOD of Director of Product Manajtemem at Stratageno. : 

2. I understand that the Examiner has raised a question as to the relative levBl:of 
sales of re] ated products available in the marke^lace upon which to base a comparison of jthe 
conmiereial success of the dahned host cell strains embodied in Straiagpae's CodonPlus''!^ 
product line. The sales figure* presented in my prior declaration en^ampaissed Stratagen^s sales 
over approximitely 20 months from the firat offer of sale in May of 1999 CBL21-Codonpiu»"^- 
RBL Competent CeUs> to shortly hefoce Fbbraaiy 9, 2001 - To my liiowledge, priorto 20^)1, 
Stx^agene held lOO^b of the market share for codonsaihanced host sells because there no 



other commeiciany available host cell strains contcuiUDg recombiuaiit genes for tRNAs specific 
for Ipw-ficquancy codons (Novagea entered the market in eady 200'. with their Rosetta™; 
strains). There would bo no reason to buy or use the more expensivi; CodonPlus™ strains'for the 
expression of genea without codon bias problems, so coa^aiiaon of sales figures from oAft 
expression sySLems ttiat are not codon optimized would not provide u meaningful backgroJind 
against which to detBRUine commetcial success. ! 

3. I hereby declare that all statements made herein of my own 1 nowledge are troe^ and ibat 
all statements made on infoimadon and bdief are believed to be truij; and further, that these 
statements were made widi the knowledge thai willful, false staterraints and the like so arc 
punishable by fine or imptiSOTinient. or both, under Section lOOl of Title 18 of the UnitedjStates 
Code, and that such willful false statements may jeopardize the vali jiQr of &e application or any 
patent issuing thereon. 

E>3te MaryBach^i \ 

Director of Trcduct Management 
Stratagen^ I 
I IQl I North Torrny Pines Road 
LaJolia,CA9;>037 j 

I 



